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General Description 


| 
| 
The NS32016 is a 32-bit, virtual memory microprocessor 
with a 16-MByte linear address space and a 16-bit external 
| data bus. it has a 32-bit ALU, eight 32-bit general purpose 
registers, an eight-byte prefetch queue, and a slave proces- 
sor interface. The NS32016 is fabricated with National 
Semiconductor's advanced XMOS process, and is fully ob- 
ject code compatible with other Series 32000 processors. 
The Series 32000 instructions set is optimized for modular 
high-level languages (HLL). The set is very symmetric, it has 
a two address format, and it incorporates HLL oriented ad- 
dressing modes. The capabilities of the NS32016 can be 
expanded with the use of the NS32081 floating point unit 
(FPU), and the NS32082 demand-paged virtual memory 
management unit (MMU). Both devices interface to the 
NS32016 as slave processors. The NS32016 is a general 
purpose microprocessor that is ideal for a wide range of 
computational intensive applications. 
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32-bit architecture and implementation 
Virtual memory support 

16-MByte linear address space 

16-bit external data bus 

Powerful instruction set 

— General 2-address capability 

— High degree of symmetry 

— Addressing modes optimized for high-level languages 
Series 32000 slave processor support 
High-speed XMOS™ technology 
48-pin dual-in-line (DIP) package 
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1.0 Product Introduction 


The Series 32000 Microprocessor family is a new genera- 
tion of devices using National’s XMOS and CMOS technolo- 
gies. By combining state-of-the-art MOS technology with a 
very advanced architectural design philosophy, this family 
brings mainframe computer processing power to VLSI proc- 
essors. 

The Series 32000 family supports a variety of system con- 
figurations, extending from a minimum low-cost system to a 
powerful 4 gigabyte system. The architecture provides com- 
plete upward compatibility from one family member to an- 
other. The family consists of a selection of CPUs supported 
by a set of peripherals and slave processors that provide 
sophisticated interrupt and memory management facilities 
as well as high-speed floating-point operations. The archi- 
tectural features of the Series 32000 family are described 
briefly below: 


Powerful Addressing Modes: Nine addressing modes 
available io all instructions are included to access data 
structures efficiently. 


Data Types: The architecture provides for numerous data 
types, such as byte, word, doubieword, and BCD, which may 
be arranged into a wide variety of data structures. 
Symmetric instruction Set: While avoiding special case 
instructions that compilers can’t use, the Series 32000 fami- 
ly incorporates powerful instructions for control operations, 
such as array indexing and external procedure calls, which 
save considerable space and time for compiled code. 
Memory-to-Memory Operations: The Series 32000 CPUs 
represent two-operand machines with each operand ad- 
dressable by all addressing modes. This powerful memory- 
to-memory architecture permits memory locations to be 
treated as registers for all useful operations. This is impor- 
tant for temporary operands as well as for context switch- 
ing. 


2.0 Architectural Description 


2.1 PROGRAMMING MODEL 


The Series 32000 architecture includes 16 registers on the 
NS32016 GPU. 


DEDICATED 


—————_—__—_ 2 -——_____—----— 


PROGRAM COUNTER | PC 


SB 


FRAME POINTER | FP 


SP1 
SP 
INTERRUPT STACK PTR. | SPO 


INTERRUPT BASE | INTBASE 


i 


STATUS MODULE 


Memory Management: Either the NS32382 or ihe 
NS32082 Memory Management Unit may be added to the 
system to provide advanced operating system support func- 
tions, including dynamic address translation, virtual memory 
management, and memory protection. 


Large, Uniform Addressing: The NS32016 has 24-bit ad- 
dress pointers that can address up to 16 megabytes without 
requiring any segmentation; this addressing scheme pro- 
vides flexible memory management without added-on ex- 
pense. 


Modular Software Support: Any software package for the 
Series 32000 family can be developed independent of all 
other packages, without regard to individual addressing. In 
addition, ROM code is totally relocatable and easy to ac- 
cess, which allows a significant reduction in hardware and 
software cost. 

Software Processor Concept: The Series 32000 architec- 
ture allows future expansions of the instruction set that can 
be executed by special slave processors, acting as exten- 
sions to the CPU. This concept of slave processors is 


patibility even for future components because the slave 
hardware is transparent to the software. With future ad- 
vances in semiconductor technology, the slaves can be 
physically integrated on the CPU chip itself. 


To summarize, the architectural features cited above pro- 
vide three primary performance advantages and character- 
istics: 

¢ High-Level Language Support 

Easy Future Growth Path 

® Application Flexibility 


GENERAL 


=—_— — ——— 32 ——_—_______ 
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FIGURE 2-1. The General and Dedicated Registers 


2.1.1 General Purpose Registers 


There are eight registers for meeting high speed general 
storage requirements, such as holding temporary variables 
and addresses. The general purpose registers are free for 
any use by the programmer. They are thirty-two bits in 


length. If a general register is specified for an operand that 
is eight or sixteen bits long, only the low part of the register 
is used; the high part is not referenced or modified. 
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2.0 Architectural Description (continued) 


2.1.2 Dedicated Registers 


The eight dedicated registers of the NS32016 are assigned 
specific functions. 

PC: The PROGRAM COUNTER register is a pointer to the 
first byte of the instruction currently being executed. The PC 
is used to reference memory in the program section. (In the 
NS32016 the upper eight bits of this register are always 
zero.) 


SPO, SP1: The SPO register points to the lowest address of 
the last item stored on the INTERRUPT STACK. This stack 
is normally used only by the operating system. It is used 
primarily for storing temporary data, and holding return infor- 
mation for operating system subroutines and interrupt and 
trap service routines. The SP1 register points to the lowest 
address of the last item stored on the USER STACK. This 
stack is used by normal user programs to hold temporary 
data and subroutine return information. 


in this document, reference is made to the SP register. The 
terms “SP register” or “SP” refer to either SPO or SF1, 
depending on the setting of the S bit in the PSR register. If 
the S bit in the PSR is 0 then SP refers to SPO. If the S bit in 
the PSR is 1 then SP refers to SP1. (in the NS32016 the 
upper eight bits of these registers are always zero.) 


Stacks in the Series 32000 family grow downward in memo- 
ry. A Push operation pre-decrements the Stack Pointer by 
the operand length. A Pop operation post-increments the 
Stack Pointer by the operand length. 


FP: The FRAME POINTER register is used by a procedure 
to access parameters and local variables on the stack. The 
FP register is set up on procedure entry with the ENTER 
instruction and restored on procedure termination with the 
EXIT instruction. 


The frame pointer holds the address in memory occupied by 
the old contents of the frame pointer. (In the NS32016 the 
upper eight bits of this register are always zero.) 
SB: The STATIC BASE register points to the global vari- 
ables of a software module. This register is used to support 
relocatable global variables for software modules. The SB 
register holds the lowest address in memory occupied by 
the global variables of a module. (In the NS32016 the upper 
eight bits of this register are always zero.) 
INTBASE: The INTERRUPT BASE register holds the ad- 
dress of the dispatch table for interrupts and traps (Section 
3.8). The INTBASE register holds the lowest address in 
memory occupied by the dispatch table: (In the NS32016 
the upper eight bits of this register are always zero.) 
MOD: The MODULE register holds the address of the mod- 
ule descriptor of the currently executing software module. 
The MOD register is sixteen bits long, therefore the module 
table must be contained within the first 64K bytes of memo- 
ry. 
PSR: The PROCESSOR STATUS REGISTER (PSR) holds 
the status codes for the NS32016 microprocessor. 
The PSR is sixteen bits long, divided into two eight-bit 
halves. The low order eight bits are accessible to all pro- 
grams, but the high order eight bits are accessible only to 
programs executing in Supervisor Mode. 

15 8|7 0 
PPD 'Teistulniz ie pX<bd cf rfc} 
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FIGURE 2-2. Processor Status Register 
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C: The C bit indicates that a carry or borrow occurred after 
an addition or subtraction instruction. It can be used with the 
ADDC and SUBC instructions to perform multiple-precision 
integer arithmetic calculations. It may have a setting of 0 (no 
carry or borrow) or 1 (carry or borrow). 


T: The T bit causes program tracing. If this bit is a 1, a TRC 
trap is executed after every instruction (Section 3.8.5). 


L: The L bit is altered by comparison instructions. In a com- 
parison instruction the L bit is set to “1” if the second oper- 
and is less than the first operand, when both operands are 
interpreted as unsigned integers. Otherwise, it is set to “0”. 
In Floating Point comparisons, this bit is always cleared. 


F: The F bit is a general condition flag, which is altered by 
many instructions (e.g., integer arithmetic instructions use it 
to indicate overflow). 


Z: The Z bit is altered by comparison instructions. In a com- 
parison instruction the Z bit is set to “1” if the second oper- 
and is equal to the first operand; otherwise it is set to “O”. 


N: The N bit is altered by comparison instructions. In a com- 
parison instruction the N bit is set to “1” if the second oper- 
and is less than the first operand, when both operands are 
interpreted as signed integers. Otherwise, it is set to “0”. 


U: If the U bit is “1” no privileged instructions may be exe- 
cuted. If the U bit is ‘‘O” then all instructions may be execut- 
ed. When U=0 the NS32016 is said to be in Supervisor 
Mode; when U= 1 the NS32016 is said to be in User Mode. 
A User Mode program is restricted from executing certain 
instructions and accessing certain registers which could in- 
terfere with the operating system. For example, a User 
Mode program is prevented from changing the setting of the 
flag used to indicate its own privilege mode. A Supervisor 
Mode program is assumed to be a trusted part of the oper- 
ating system, hence it has no such restrictions. 


5: The S bit specifies whether the SPO register or SP7 regis- 
ter is used as the stack pointer. The bit is automatically 
cleared on interrupts and traps. It may have a setting of 0 
(use the SPO register) or 1 (use the SP1 register). 


P: The P bit prevents a TRC trap from occurring more than 
once for an instruction (Section 3.8.5). It may have a setting 
of 0 (no trace pending) or 1 (trace pending). 

I: !f 1=1, then all interrupts will be accepted (Section 3.8). If 
1=0, only the NMI interrupt is accepted. Trap enables are 
not affected by this bit. 


2.1.3 The Configuration Register (CFG) 


Within the Control section of the NS32016 CPU is the four- 
bit CFG Register, which declares the presence of certain 
external devices. It is referenced by only one instruction, 
SETCFG, which is intended to be executed only as part of 
system initialization after reset. The format of the CFG Reg- 
ister is shown in Figure 2-3. 


po [mr |i | 


FIGURE 2-3. CFG Register 


The CFG | bit declares the presence of external interrupt 
vectoring circuitry (specifically, the NS32202 Interrupt Con- 
trol Unit). If the CFG | bit is set, interrupts requested through 
the INT pin are “Vectored.” If it is clear, these interrupts are 
“Non-Vectored.” See Section 3.8. 


The F, M and C bits declare the presence of the FPU, MMU 
and Custom Slave Processors. If these bits are not set, the 
corresponding instructions are trapped as being undefined. 





| 2.0 Architectural Description (continued) | 


The main memory of the NS32016 is a uniform linear ad- 
dress space. Memory locations are numbered sequentially 
starting at zero and ending at 224 — 1. The number specify- 
ing a memory location is called an address. The contents of 
each memory location is a byte consisting of eight bits. Un- 
less otherwise noted, diagrams in this document show data 
stored in memory with the lowest address on the right and 
the highest address on the left. Also, when data is shown 
vertically, the lowest address is at the top of a diagram and 
the highest address at the bottom of the diagram. When bits 
are numbered in a diagram, the least significant bit is given 
the number zero, and is shown at the right of the diagram. 
Bits are numbered in increasing significance and toward the 


left. 


A 
Byte at Address A 


Two contiguous bytes are called a word. Except where not- 
ed (Section 2.2.1), the least significant byte of a word is 
stored at the lower address, and the most significant byte of 
the word is stored at the next higher address. In memory, 
the address of a word is the address of its least significant 
byte, and a word may start at any address. 


15 MSB’s 8/7 LSB’s 0 


A+1 A 
Word at Address A 


Two contiguous words are called a double word. Except 
where noted (Section 2.2.1), the least significant word of a 
double word is stored at the lowest address and the most 
significant word of the double word is stored at the address 
two greater. In memory, the address of a double word is the 
address of its least significant byte, and a double word may 
start at any address. 


2.1.4 Memory Organization 


16 0 
8 









| woo! 
31 
sens 
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FIGURE 2-4. Module Descriptor Format 
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At+3 A+2 A+1 A 
Double Word at Address A 


Although memory is addressed as bytes, it is actually orga- 
nized as words. Therefore, words and double words that are 
aligned to start at even addresses (multiples of two) are 
accessed more quickly than words and double words that 
are not so aligned. 


2.1.5 Dedicated Tables 


Two of the NS32016 dedicated registers (MOD and INT- 
BASE) serve as pointers to dedicated tables in memory. 


The INTBASE register points to the Interrupt Dispatch and 
Cascade tables. These are described in Section 3.8. 


The MOD register contains a pointer into the Module Table, 
whose entries are called Module Descriptors. A Module De- 
scriptor contains four pointers, three of which are used by 
the NS32016. The MOD register contains the address of the 
Module Descriptor for the currently running module. It is au- 
tomatically updated by the Call External Procedure instruc- 
tions (CXP and CXPD). 

The format of a Module Descriptor is shown in Figure 2-4. 
The Static Base entry contains the address of static data 
assigned to the running module. It is loaded into the CPU 
Static Base register by the CXP and CXPD instructions. The 
Program Base entry contains the address of the first byte of 
instruction code in the module. Since a module may have 
multiple entry points, the Program Base pointer serves only 
as a reference to find them. 


The Link Table Address points to the Link Table for the 
currently running module. The Link Table provides the infor- 
mation needed for: 


1) Sharing variables between modules. Such variables 
are accessed through the Link Table via the External 
addressing mode. 


ENTRY [31 


0 
ABSOLUTE ADDRESS 
ae Te 
TL/EE/5054-5 


FIGURE 2-5. A Sample Link Table 






(VARIABLE) 






(VARIABLE) 






(PROCEDURE) 
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FIGURE 2-6. General Instruction Format 
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2.0 Architectural Description (Continued) 


2) Transferring contro! from one module to another. This 
is done via the Call External Procedure (CXP) instruc- 
tion. 

The format of a Link Table is given in Figure 2-5. A Link 
Table Entry for an external variable contains the 32-bit ad- 
dress of that variable. An entry for an external procedure 
contains two 16-bit fields: Module and Offset. The Module 
field contains the new MOD register contents for the mod- 
ule being entered. The Offset field is an unsigned number 
giving the position of the entry point relative to the new 
module’s Program Base pointer. 


For further details of the functions of these tables, see the 
Series 32000 Instruction Set Reference Manual. 


2.2 INSTRUCTION SET 


2.2.1 General Instruction Format 


Figure 2-6 shows the general format of a Series 32000 in- 
struction. The Basic Instruction is one to three bytes long 
and contains the Opcode and up to 5-bit General Address- 
ing Mode (“Gen”) fields. Following the Basic Instruction 
field is a set of optional extensions, which may appear de- 
pending on the instruction and the addressing modes se- 
lected. 


Index Bytes appear when either or both Gen fields specify 
Scaled Index. In this case, the Gen field specifies only the 
Scale Factor (1, 2, 4 or 8), and the Index Byte specifies 
which General Purpose Register to use as the index, and 
which addressing mode calculation to perform before index- 
ing. See Figure 2-7. 


TL/EE/5054-7 
FIGURE 2-7. Index Byte Format 


Following Index Bytes come any displacements (addressing 
constants) or immediate values associated with the select- 
ed addressing modes. Each Disp/Imm field may contain 
one of two displacements, or one immediate value. The size 
of a Displacement field is encoded within the top bits of that 
field, as shown in Figure 2-8, with the remaining bits inter- 
preted as a signed (two’s complement) value. The size of an 
immediate value is determined from the Opcode field. Both 
Displacement and Immediate fields are stored most-signifi- 
cant byte first. Note that this is different from the memory 
representation of data (Section 2.1.4). 


Some instructions require additional “implied” immediates 
and/or displacements, apart from those associated with ad- 
dressing modes. Any such extensions appear at the end of 
the instruction, in the order that they appear within the list of 
operands in the instruction definition (Section 2.2.3). 


2.2.2 Addressing Modes 


The NS32016 CPU generally accesses an operand by cal- 
culating its Effective Address based on information avail- 
able when the operand is to be accessed. The method to be 
used in performing this calculation is specified by the pro- 
grammer as an “addressing mode.” 

Addressing rnodes in the NS32016 are designed to optimal- 
ly support high-level language accesses to variables. In 
nearly all cases, a variable access requires only one ad- 
dressing mode, within the instruction that acts upon that 
variable. Extraneous data movement is therefore minimized. 
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NS32016 Addressing Modes fall into nine basic types: 


Register: The operand is available in one of the eight Gen- 
eral Purpose Registers. In certain Slave Processor instruc- 
tions, an auxiliary set of eight registers may be referenced 
instead. 


Register Relative: A General Purpose Register contains an 
address to which is added a displacement value from the 
instruction, yielding the Effective Address of the operand in 
memory. 
Memory Space: Identical to Register Relative above, ex- 
cept that the register used is one of the dedicated registers 
PC, SP, SB or FP. These registers point to data areas gen- 
erally needed by high-level languages. 

Byte Displacement: Range — 64 to + 63 

7 oe 


Po] seeonsnacme 


Word Displacement: Range —8192 to + 8191 





Double Word Displacement: 
Range (Entire Addressing Space) 


TL/EE/5054-10 
FIGURE 2-8. Displacement Encodings 


Memory Relative: A pointer variable is found within the 
memory space pointed to by the SP, SB or FP register. A 
displacement is added to that pointer to generate the Effec- 
tive Address of the operand. 


immediate: The operand is encoded within the instruction. 
This addressing mode is not allowed if the operand is to be 
written. 


Absolute: The address of the operand is specified by a 
displacement field in the instruction. 


External: A pointer value is read from a specified entry of 
the current Link Table. To this pointer value is added a dis- 
placement, yielding the Effective Address of the operand. 


Top of Stack: The currently-selected Stack Pointer (SPO or 
SP1) specifies the location of the operand. The operand is 
pushed or popped, depending on whether it is written or 
read. 


Scaled Index: Although encoded as an addressing mode, 
Scaled Indexing is an option on any addressing mode ex- 
cept Immediate or another Scaled Index. It has the effect of 
calculating an Effective Address, then multiplying any Gen- 
eral Purpose Register by 1, 2, 4 or 8 and adding into the 
total, yielding the final Effective Address of the operand. 

Table 2-1 is a brief summary of the addressing modes. For a 


complete description of their actions, see the Series 32000 
Instruction Set Reference Manual. 
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ENCODING 
Register 
00000 
00001 
00010 
00011 
00100 
00101 
00110 
00111 


Register Relative 


01000 
01001 
01010 
01011 
01100 
01101 
01110 
01111 


Memory Relative 


10000 
10001 
10010 


Reserved 
10011 
Immediate 
10100 


Absolute 
10101 
External 
10110 


Top Of Stack 
10111 


Memory Space 
11000 

11001 

11010 

11014 

Scaled Index 
11100 

11101 

11110 

11111 


TABLE 2-1 


NS32016 Addressing Modes 


MODE 


Register 0 
Register 1 
Register 2 
Register 3 
Register 4 
Register 5 
Register 6 
Register 7 


Register 0 relative 
Register 1 relative 
Register 2 relative 
Register 3 relative 
Register 4 relative 
Register 5 relative 
Register 6 relative 
Register 7 relative 


Frame memory relative 


Stack memory relative 
Static memory relative 


(Reserved for Future Use) 


Immediate 


Absolute 


External 


Top of stack 


Frame memory 
Stack memory 
Static memory 
Program memory 


Index, bytes 

Index, words 

Index, double words 
Index, quad words 


ASSEMBLER SYNTAX 


RO or FO 
Ri or F1 
R2 or F2 
R3 or F3 
R4 or F4 
R5 or F5 
R6 or F6 
R6 or F7 


disp(RO) 
disp(R1) 
disp(R2) 
disp(R3) 
disp(R4) 
disp(R5) 
disp(R6) 
disp(R7) 


disp2(disp1 (FP)) 
disp2(disp1 (SP)) 
disp2(disp1 (SB)) 


value 


@disp 


EXT (disp1) + disp2 


TOS 


disp(FP) 
disp(SP) 
disp(SB) 
*+ disp 


mode[Rn:B] 
mode[Rn:W] 
mode[Rn:D] 
mode[Rn:Q] 


EFFECTIVE ADDRESS 


None: Operand is in the specified 
register. 


Disp + Register. 


Disp2 + Pointer; Pointer found at 
address Disp 1 + Register. “SP” 
is either SPO or SP1, as selected 

in PSR. 


None: Operand is input from 
instruction queue. 


Disp. 


Disp2 + Pointer; Pointer is found 
at Link Table Entry number Disp1. 


Top of current stack, using either 
User or Interrupt Stack Pointer, 
as selected in PSR. Automatic 
Push/Pop included. 


Disp + Register; “SP” is either 
SPO or SP1, as selected in PSR. 


EA (mode) + Rn. 

EA (mode) + 2XRn. 

EA (mode) + 4XRn. 

EA (mode) + 8XRn. 

“Mode” and “n” are contained 
within the Index Byte. 

EA (mode) denotes the effective 
address generated using mode. 
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2.0 Architectural Description (continued) 


2.2.3 Instruction Set Summary 

Table 2-2 presents a brief description of the NS32016 in- 
struction set. The Format column refers to the Instruction 
Format tables (Appendix A). The Instruction column gives 
the instruction as coded in assembly language, and the De- 
scription column provides a short description of the function 
provided by that instruction. Further details of the exact op- 
erations performed by each instruction may be found in the 
Series 32000 Instruction Set Reference Manual. 


Notations: 
i= Integer length suffix: B = Byte 

W= Word 

D = Double Word 
f= Floating Point length suffix: F= Standard Floating 

L= Long Floating 

gen= General operand. Any addressing mode can be speci- 
fied. 


short=A 4-bit value encoded within the Basic Instruction 
(see Appendix A for encodings). 


imm= Implied immediate operand. An 8-bit value appended 
after any addressing extensions. 


disp = Displacement (addressing constant): 8, 16 or 32 bits. 
All three lengths legal. 


reg= Any General Purpose Register: RO-R7. 


areg= Any Dedicated/Address Register: SP, SB, FP, MOD, 
INTBASE, PSR, US (bottom 8 PSR bits). 


mreg= Any Memory Management Status/Control Register. 
creg= A Custom Slave Processor Register (Implementation 
Dependent). 

cond=Any condition code, encoded as a 4-bit field within 
the Basic Instruction (see Appendix A for encodings). 


TABLE 2-2 
NS32016 Instruction Set Summary 


MOVES 


Format Operation Operands 


MOVi gen,gen 
MOVQi short,gen 
MOVMi gen,gen,disp 
MOVZBW gen,gen 
MOVZiD gen,gen 
MOVXBW gen,gen 
MOVXiD gen,gen 
ADDR gen,gen 


INTEGER ARITHMETIC 


Format Operation Operands 


bh 


ADDOQi short,gen 
ADDCi gen,gen 
SUBi gen,gen 
SUBCi gen,gen 
NEGi gen,gen 
ABSi gen,gen 
MULIi gen,gen 
QUOI gen,gen 
REMi gen,gen 
DIVi gen,gen 
MODi gen,gen 
MEli gen,gen 
DEli gen,gen 


a NNN NNO OO A FP SH Pf 


~! 


PACKED DECIMAL (BCD) ARITHMETIC 


Format Operation Operands 


6 ADDPi 
6 SUBPi 


gen,gen 
gen,gen 


ADDi gen,gen Add. 

Add signed 4-bit constant. 
Add with carry. 

Subtract. 

Subtract with carry (borrow). 
Negate (2’s complement). 
Take absolute value. 
Multiply. 

Divide, rounding toward zero. 
Remainder from QUO. 
Divide, rounding down. 
Remainder from DIV (Modulus). 
Multiply to extended integer. 
Divide extended integer. 


Description 


Move a value. 

Extend and move a signed 4-bit constant. 
Move multiple: disp bytes (1 to 16). 

Move with zero extension. 

Move with zero extension. 

Move with sign extension. 

Move with sign extension. 

Move effective address. 


Description 


Description 


Add packed. 
Subtract packed. 
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TABLE 2-2 
NS32016 Instruction Set Summary (Continued) 


INTEGER COMPARISON 
Format Operation Operands Description 
4 CMPi gen,gen Compare. 
2 CMPQi short,gen Compare to signed 4-bit constant. 
7 CMPMi gen,gen,disp Compare multiple: disp bytes (1 to 16). 
LOGICAL AND BOOLEAN 
Format Operation Operands Description 
4 ANDi gen,gen Logical AND. 
4 ORi gen,gen Logical OR. 
4 BICi gen,gen Clear selected bits. 
4 XORi gen,gen Logical exclusive OR. 
6 COMi gen,gen Complemeni all bits. 
6 NOTi gen,gen Boolean complement: LSB only. 
2 Scondi gen Save condition code (cond) as a Boolean variable of size i. 
SHIFTS 
Format Operation Operands Description 
6 LSHi gen,gen Logical shift, left or right. 
6 ASHi gen,gen Arithmetic shift, left or right. 
6 ROTI gen,gen Rotate, left or right. 
BITS 
Format Operation Operands Description 
4 TBITi gen,gen Test bit. 
6 SBITi gen,gen Test and set bit. 
6 SBITIi gen,gen Test and set bit, interlocked. 
6 CBITi gen,gen Test and clear bit. 
6 CBITH gen,gen Test and clear bit, interlocked. 
6 IBITi gen,gen Test and invert bit. 
8 FFSi gen,gen Find first set bit. 
BIT FIELDS 


Bit fields are values in memory that are not aligned to byte boundaries. Examples are PACKED arrays and records used in 
Pascal. “Extract” instructions read and align a bit field. “Insert” instructions write a bit field from an aligned source. 


Format Operation Operands Description 
8 EXTi reg,gen,gen,disp Extract bit field (array oriented). 
8 INSi reg,gen,gen, disp Insert bit field (array oriented). 
7 EXTSi gen,gen,imm,imm Extract bit field (short form). 
7 INSSi gen,gen,imm,imm Insert bit field (short form). 
8 CVTP reg,gen,gen Convert to bit field pointer. 

ARRAYS 

Format Operation Operands Description 
8 CHECKi reg,gen,gen Index bounds check. 
8 INDEXi reg,gen,gen Recursive indexing step for multiple-dimensional arrays. 
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2.0 Architectural Description (Continued) 


STRINGS 


String instructions assign specific functions to the General 
Purpose Registers: 


R4 — Comparison Value 


R3 — Translation Table Pointer 


R2 — String 2 Pointer 
R1 — String 1 Pointer 


RO — Limit Count 


Format 


Operation 


MOVSi 
MOVST 
CMPSi 
CMPST 
SKPSi 
SKPST 


JUMPS AND LINKAGE 


Format 


3 
0 
0 
3 
2 
3 
1 
1 
3 
1 
1 
1 
1 
1 
1 
1 
1 
1 


CPU REGISTER MANIPULATION 


Format 


anand nhb = — 


Operation 


JUMP 
BR 
Bcond 
CASEi 
ACBi 
JSR 
BSR 
CXP 
CXPD 
SVC 
FLAG 
BPT 
ENTER 
EXIT 
RET 
RXP 
RETT 
RET! 


Operation 


SAVE 
RESTORE 
LPRi 

SPRi 
ADJSPi 
BISPSRi 
BICPSRi 
SETCFG 


TABLE 2-2 


NS32016 Instruction Set Summary (Continued) 


Operands 


options 
options 
options 
options 
options 
options 


Operands 


gen 
disp 
disp 
gen 
short,gen,disp 
gen 
disp 
disp 
gen 


[reg list], disp 
{reg list] 

disp 

disp 

disp 


Operands 


[reg list] 
[reg list] 
areg,gen 
areg,gen 
gen 

gen 

gen 
[option list] 


Options on all string instructions are: 


B (Backward): Decrement strong pointers after each 
step rather than incrementing. 


End instruction if String 1 entry matches 
R4. 


W (While match): End instruction if String 1 entry does not 
match Ré4. 


All string instructions end when RO decrements to zero. 


U (Unti! match): 


Description 


Move string 1 to string 2. 

Move string, translating bytes. 
Compare string 1 to string 2. 
Compare, translating string 1 bytes. 
Skip over string 1 entries. 

Skip, translating bytes for until/while. 


Description 


Jump. 

Branch (PC Relative). 

Conditional branch. 

Multiway branch. 

Add 4-bit constant and branch if non-zero. 

Jump to subroutine. 

Branch to subroutine. 

Call external procedure 

Cail external procedure using descriptor. 

Supervisor call. 

Fiag trap. 

Breakpoint trap. 

Save registers and allocate stack frame (Enter Procedure). 
Restore registers and reclaim stack frame (Exit Procedure). 
Return from subroutine. 

Return from external procedure call. 

Return from trap. (Privileged) 

Return from interrupt. (Privileged) 


Description 


Save general purpose registers. 

Restore general purpose registers. 

Load dedicated register. (Privileged if PSR or INTBASE) 
Store dedicated register. (Privileged if PSR or INTBASE) 
Adjust stack pointer. 

Set selected bits in PSR. (Privileged if not Byte length) 
Clear selected bits in PSR. (Privileged if not Byte length) 
Set configuration register. (Privileged) 





9-986 








, = 
| 2.0 Architectural Description (Continued) | 8 
TABLE 2-2 8 
NS32016 Instruction Set Summary (Continued) > 
FLOATING POINT o 
Format Operation Operands Description > 
11 MOVE gen,gen Move a floating point value. ” 
9 MOVLF gen,gen Move and shorten a long value to standard. 3 
9 MOVFL gen,gen Move and lengthen a standard value to long. 2° 
| 9 MOVif gen,gen Convert any integer to standard or tong floating. > 
9 ROUNDfi gen,gen Convert to integer by rounding. & 
| 9 TRUNCfi gen,gen Convert to integer by truncating, toward zero. S 
H 9 FLOORS gen,gen Convert to largest integer less than or equal to value. Go 
| 11 ADDf gen,gen Add. Ss 
11 SUBf gen,gen Subtract. a 
11 MULf gen,gen Multiply. _ 
11 Divi gen,gen Divide. o 
11 CMPf gen,gen Compare. 
11 NEGf gen,gen Negate. 
11 ABSf gen,gen Take absolute value. 
9 LFSR gen Load FSR. 
! 9 SFSR gen Store FSR. 
| MEMORY MANAGEMENT 
Format Operation Operands Description 
14 LMR mreg,gen Load memory management register. (Privileged) 
14 SMR mreg,gen Store memory management register. (Privileged) 
14 RDVAL gen Validate address for reading. (Privileged) 
14 WRVAL gen Validate address for writing. (Privileged) 
| 8 MOVSUi gen,gen Move a value from supervisor 
space to user space. (Privileged) 
| 8 MOVUSi gen,gen Move a value from user space 
| to supervisor space. (Privileged) 
| MISCELLANEOUS 
Format Operation Operands Description 
1 NOP No operation. 
1 WAIT Wait for interrupt. 
1 DIA Diagnose. Single-byte ‘‘Branch to Self” for hardware 
breakpointing. Not for use in programming. 
CUSTOM SLAVE 
Format Operation Operands Description 
15.5 CCALOc gen,gen Custom calculate. 
15.5 CCALIc gen,gen 
15.5 CCAL2c gen,gen 
16.5 CCAL3c gen,gen 
15.5 CMOV0c gen,gen Custom move. 
15.5 CMOVic gen,gen 
15.5 CMOV2c gen,gen 
CMOV3c gen,gen 
15.5 CCMPc gen,gen Custom compare. 
CCMPtc gen,gen 
15.1 CCVO0ci gen,gen Custom convert. 
15.1 CCV Ici gen,gen 
15.1 CCV2ci gen,gen 
15.1 CCV3ic gen,gen 
15.1 CCV4DQ gen,gen 
15.1 CCV5QD gen,gen 
15.1 LCSR gen Load custom status register. 
15.1 SCSR gen Store custom status register. 
15.0 CATSTO gen Custom address/test. (Privileged) 
15.0 CATST1 gen (Privileged) 
15.0 LCR creg,gen Load custom register. (Privileged) 
15.0 SCR creg,gen Store custom register. (Privileged) 
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3.0 Functional Description 


3.1 POWER AND GROUNDING 


The NS32016 requires a single 5-volt power supply, applied 

on pin 48 (Vcc). 

Grounding connections are made on two pins. Logic Ground 

(GNDL, pin 24) is the common pin for on-chip logic, and 

Buffer Ground (GNDB, pin 25) is the common pin for the 

output drivers. For optimal noise immunity, it is recommend- 

ed that GNDL be attached through a single conductor di- 

rectly to GNDB, and that all other grounding connections be 

made only to GNDB, as shown below (Figure 3-7). 

In addition to Voc and Ground, the NS32016 CPU uses an 

internally-generated negative voltage. It is necessary to filter 

this voltage externally by attaching a pair of capacitors (Fig- 

ure 3-1) from the BBG pin to ground. Recommended values 

of these are: 

Cy: 1 wF, Tantalum. 

Co: 1000 pF, low inductance. This should be either a disc or 
monolithic ceramic capacitor. 


Oo +5V 


Vee [48] 


OTHER GROUND 
CONNECTIONS 





TL/EE/5054-11 
FIGURE 3-1. Recommended Supply Connections 


3.2 CLOCKING 

The NS32016 inputs clocking signals from the NS32201 
Timing Control Unit (TCU), which presents two non-overlap- 
ping phases of a single clock frequency. These phases are 
called PHH (pin 26) and PHI2 (pin 27). Their relationship to 
each other is shown in Figure 3-2. 


Vec 


Each rising edge of PHI1 defines a transition in the timing 
state (‘T-State”) of the CPU. One T-State represents the 
execution of one microinstruction within the CPU, and/or 
one step of an external bus transfer. See Section 4 for com- 
plete specifications of PHI1 and PHI2. 


ONE T-STATE = 


NON-OVERLAPPING 








TL/EE/5054-12 
FIGURE 3-2. Clock Timing Relationships 


As the TCU presents signals with very fast transitions, it is 
recommended that the conductors carrying PHI1 and PHI2 
be kept as short as possible, and that they not be connect- 
ed anywhere except from the TCU to the CPU and, if pres- 
ent, the MMU. A TTL Clock signal (CTTL) is provided by the 
TCU for all other clocking. 


3.3 RESETTING 


The RST/ABT pin serves both as a Reset for on-chip logic 
and as the Abort input for Memory-Managed systems. For 
its use as the Abort Command, see Section 3.5.4. 


The CPU may be reset at any time by pulling the RST/ABT 
pin low for at least 64 clock cycles. Upon detecting a reset, 
the CPU terminates instruction processing, resets its inter- 
nal logic, and clears the Program Counter (PC) and Proces- 
sor Status Register (PSR) to all zeroes. 

On application of power, RST/ABT must be held low for at 
least 50 ps after Vcc is stable. This is to ensure that all on- 
chip voltages are completely stable before operation. 
Whenever a Reset is applied, it must also remain active 


-————_—_- 





— 


_ 


2 64 CLOCK 
CYCLES 


AST/ABT —__;—_______+ 


250 .sec —— —-—-- -- 
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FIGURE 3-3. Power-On Reset Requirements 











3.0 Functional Description (continue) 


for not iess than 64 clock cycles. The rising edge must oc- 
cur while PHI1 is high. See Figures 3-3 and 3-4. 

The NS32201 Timing Control Unit (TCU) provides circuitry 
to meet the Reset requirements of the NS32016 CPU. Fig- 
ure 3-5a shows the recommended connections for a non- 
Memory-Managed system. Figure 3-5b shows the connec- 
tions for a Memory-Managed system. 


















|} = 64 CLOCK —.--| 
CYCLES 


RST/ABT 
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FIGURE 3-4. General Reset Timing 


NS32201 
TCU 







ASTI RSTO 
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SYSTEM RESET 
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FIGURE 3-5a. Recommended Reset Connections, Non-Memory-Managed System 





NS32016 
CPU 





RSTI RSTO RST/ABT 
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FIGURE 3-5b. Recommended Reset Connections, Memory-Managed System 


pro----------- 

r---4 

| RES 1 1 

H I 1 

ido | 

Lew we wee eee ee 

EXTERNAL RESET ? Ts 
PTIONAL) H + £ 

Louw 
RESET SWITCH 
(OPTIONAL) 

pe--n---- He 

| 

j 

| RESET ! , 

i 

| 

ae 140} 

EXTERNAL RESET 1 
(OPTIONAL) H = = 
Lod 
RESET SWITCH 
(OPTIONAL) 
3.4 BUS CYCLES 


The NS32016 CPU has a strap option which defines the Bus 
Timing Mode as either With or Without Address Translation. 
This section describes only bus cycles under the No Ad- 
dress Translation option. For details of the use of the strap 
and of bus cycles with address translation, see Section 3.5. 


The CPU will perform a bus cycle for one of the following 

reasons: 

1) To write or read data, to or from memory or a peripheral 
interface device. Peripheral input and output are memo- 
ry-mapped in the Series 32000 family. 

2) To fetch instructions into the eight-byte instruction 
queue. This happens whenever the bus would otherwise 
be idle and the queue is not already full. 


3) To acknowledge an interrupt and allow external circuitry 
to provide a vector number, or to acknowledge comple- 
tion of an interrupt service routine. 


4) To transfer information to or from a Slave Processor. 


In terms of bus timing, cases 1 through 3 above are identi- 
cal. For timing specifications, see Section 4. The only exter- 
nal difference between them is the four-bit code placed on 
the Bus Status pins (STO-ST3). Slave Processor cycles dif- 
fer in that separate control signals are applied (Section 
3.4.6). 


The sequence of events in a non-Slave bus cycle is shown 
in Figure 3-7 for a Read cycle and Figure 3-8 for a Write 
cycle. The cases shown assume that the selected memory 
or interface device is capable of communicating with the 
CPU at full speed. If it is not, then cycle extension may be 
requested through the RDY line (Section 3.4.1). 
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3.0 Functional Description (Continued) 


A full-speed bus cycle is performed in four cycles of the 
PHI1 clock signal, labeled T1 through T4. Clock cycles not 
associated with a bus cycle are designated Ti (for “Idle”). 


During T1, the CPU applies an address on pins ADO-AD15 
and A16-A23. It also provides a low-going pulse on the 
ADS pin, which serves the dual purpose of informing exter- 
nal circuitry that a bus cycle is starting and of providing con- 
trol to an external latch for demultiplexing Address bits 0- 
15 from the ADO-AD15 pins. See Figure 3-6. During this 
time also the status signals DDIN, indicating the direction of 
the transfer, and HBE, indicating whether the high byte 
(AD8—AD15) is to be referenced, become valid. 


During T2 the CPU switches the Data Bus, ADO-AD15, to 
either accept or present data. Note that the signals A16- 
A23 remain valid, and need not be latched. It also starts the 
data strobe (DS), signaling the beginning of the data trans- 
fer. Associated signals from the NS32201 Timing Control 
Unit are also activated at this time: RD (Read Strobe) or WR 
(Write Strobe), TSO (Timing State Output, indicating that T2 
has been reached) and DBE (Data Buffer Enable). 


ADO-ADI5 


NS32016 





A16-A23 
PHI1 PHI2  OS/FLT 


PHI1  PHI2 ADS DDIN 


NS32201 


The T3 state provides for access time requirements, and it 
occurs at least once in a bus cycle. At the end of T2 or T3, 
on the falling edge of the PHI2 clock, the RDY (Ready) line 
is sampled to determine whether the bus cycle will be ex- 
tended (Section 3.4.1). 


If the CPU is performing a Read cycle, the Data Bus (ADO- 
AD15) is sampled at the falling edge of PHI2 of the last T3 
state. See Section 4. Data must, however, be held at least 
until the beginning of T4. DS and RD are guaranteed not to 
go inactive before this point, so the rising edge of either of 
them may safely be used to disable the device providing the 
input data. 


The T4 state finishes the bus cycle. At the beginning of T4, 
the DS, RD, or WR, and TSO signals go inactive, and at the 
rising edge of PHI2, DBE goes inactive, having provided for 
necessary data hold times. Data during Write cycles re- 
mains valid from the CPU throughout T4. Note that the Bus 
Status lines (STO-ST3) change at the beginning of T4, an- 
ticipating the following bus cycle (if any). 


AO(LBE) 
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FIGURE 3-6. Bus Connections 
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NS32016 CPU BUS SIGNALS 
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FIGURE 3-7. Read Cycle Timing 
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3.0 Functional Description (continued) 


NS32016 CPU BUS SIGNALS 
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FIGURE 3-8, Write Cycle Timing 
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3.4.1 Cycle Extension 

To allow sufficient strobe widths and access times for any 
speed of memory or peripheral device, the NS32016 pro- 
vides for extension of a bus cycle. Any type of bus cycle 
except a Slave Processor cycle can be extended. 


In Figures 3-7 and 3-8, note that during T3 all bus control 
signals from the CPU and TCU are flat. Therefore, a bus 
| cycle can be cleanly extended by causing the T3 state to be 
| repeated. This is the purpose of the RDY (Ready) pin. 
' At the end of T2 on the falling edge of PHI2, the RDY line is 
sampled by the CPU. If RDY is high, the next T-states will be 
T3 and then T4, ending the bus cycle. If it is sampled low, 
then another T3 state will be inserted after the next T-state 
and the RDY line will again be sampled on the falling edge 
of PHI2. Each additional T3 state after the first is referred to 
as a “wait state.” See Figure 3-9. 


PHI 2 


RDY 








| 13 


NEXT 
STATE: 
13 





The RDY pin is driven by the NS32201 Timing Control Unit, 
which applies WAIT States to the CPU as requested on 
three sets of pins: 

1) CWAIT (Continues WAIT), which holds the CPU in WAIT 
States until removed. 

2) WAIT1, WAIT2, WAIT4, WAIT8 (Collectively WAITn), 
which may be given a four-bit binary value requesting a 
specific number of WAIT States from 0 to 15. 

3). PER (Peripheral), which inserts five additional WAIT 
states and causes the TCU to reshape the AD and WR 
strobes. This provides the setup and hold times required 
by most MOS peripherai interface devices. 

Combinations of these various WAIT requests are both legal 

and useful. For details of their use, see the NS32201 TCU 

Data Sheet. 

Figure 3-70 illustrates a typical Read cycle, with two WAIT 

states requested through the TCU WAITn pins. 





T3 
(WAIT) v4 


NEXT 
STATE: 
T4 
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FIGURE 3-9. RDY Pin Timing 


3.4.2 Bus Status 


The NS32016 CPU presents four bits of Bus Status informa- 
tion on pins STO-ST3. The various combinations on these 
pins indicate why the CPU is performing a bus cycle, or, if it 
is idle on the bus, then why it is idle. 


Referring to Figures 3-7 and 3-8, note that Bus Status leads 
the corresponding Bus Cycle, going valid one clock cycle 
before T1, and changing io the next state at T4. This allows 
the system designer to fully decode the Bus Status and, if 
desired, latch the decoded signals before ADS initiates the 
Bus Cycle. 

The Bus Status pins are interpreted as a four-bit value, with 
STO the least significant bit. Their values decode as follows: 


0000 — The bus is idle because the CPU does not need 
to perform a bus access. 


0001— The bus is idle because the CPU is executing 
the WAIT instruction. 

0010 — (Reserved for future use.) 

0011— The bus is idle because the CPU is waiting for a 
Slave Processor to complete an instruction. 

0100 — Interrupt Acknowledge, Master. 


The CPU is performing a Read cycle. To ac- 
knowledge receipt of a Non-Maskable Interrupt 
(on NMI) it will read from address FFFFO04,, 
but will ignore any data provided. 

To acknowledge receipt of a Maskable Interrupt 
(on INT) it will read from address FFFE0046, 
expecting a vector number to be provided from 





9.993 


the Master NS32202 Interrupt Control Unit. If 
the vectoring mode selected by the last 
SETCFG instruction was Non-Vectored, then 
the GPU will ignore the value it has read and will 
use a default vector instead, having assumed 
that no NS32202 is present. See Section 3.4.5. 


0101 — Interrupt Acknowledge, Cascaded. 
The CPU is reading a vector number from a 
Cascaded NS32202 Interrupt Control Unit. The 
address provided is the address of the 
NS32202 Hardware Vector register. See Sec- 
tion 3.4.5. 

0110— End of interrupt, Master. 
The CPU is performing a Read cycle to indicate 
that it is executing a Return from Interrupt 
(RET}) instruction. See Section 3.4.5. 

0111— End of Interrupt, Cascaded. 
The CPU is reading from a Cascaded Interrupt 
Control Unit to indicate that it is returning 
(through RETI) from an interrupt service routine 
requested by that unit. See Section 3.4.5. 

1000-— Sequential Instruction Fetch. 


The CPU is reading the next sequential word 
from the Instruction siream into the Instruction 
Queue. It will do so whenever the bus would 
otherwise be idle and the quaue is not already 
full. 
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3.0 Functional Description (Continued) 


NS32016 CPU BUS SIGNALS 
PREV. CYCLE NEXT CYCLE 
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NS32201 TCU CYCLE EXTENSION SIGNALS 
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pov [ 


(TCU TO CPU) 





NS32201 TCU BUS SIGNALS 
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FIGURE 3-10. Extended Cycle Example 


Note: Arrows on CWAIT, PER, WAITn indicate points at which the TCU samples. Arrows on ADO-AD15 and 
RDY indicate points at which the CPU samples. 
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| 3.0 Functional Description (continued) | 


1001— Non-Sequential Instruction Fetch. 

The CPU is performing the first fetch of instruc- 
tion code after the Instruction Queue is purged. 
This will occur as a result of any jump or branch, 
or any interrupt or trap, or execution of certain 
instructions. 

Data Transfer. 

The CPU is reading or writing an operand of an 
instruction. 

Read RMW Operand. 

The CPU is reading an operand which will sub- 
sequently be modified and rewritten. If memory 
protection circuitry would not allow the following 
Write cycle, it must abort this cycle. 

Read for Effective Address Calculation. 

The CPU is reading information from memory in 
order to determine the Effective Address of an 
operand. This will occur whenever an insiruc- 
tion uses the Memory Relative or External ad- 
dressing mode. 

Transfer Slave Processor Operand. 

The CPU is either transferring an instruction op- 
erand to or from a Slave Processor, or it is issu- 
ing the Operation Word of a Slave Processor 
instruction. See Section 3.9.1. 

Read Slave Processor Status. 

The CPU is reading a Status Word from a Slave 
Processor. This occurs after the Slave Proces- 
sor has signalled completion of an instruction. 
The transferred word tells the CPU whether a 
trap should be taken, and in some instructions it 
presents new values for the CPU Processor 
Status Register bits N, Z, L or F. See Section 
3.9.1. 

Broadcast Slave ID. 

The CPU is initiating the execution of a Slave 
Processor instruction. The ID Byte (first byte of 
the instruction) is sent to all Slave Processors, 
one of which will recognize it. From this point 
the CPU is communicating with only one Slave 
Processor. See Section 3.9.1. 


1010 — 


1011 — 


1100 — 


1101 — 


1110 — 


1111 — 


3.4.3 Data Access Sequences 


The 24-bit address provided by the NS32016 is a byte ad- 
dress; that is, it uniquely identifies one of up to 16,777,216 
eight-bit memory locations. An important feature of the 
NS32016 is that the presence of a 16-bit data bus imposes 
no restrictions on data alignment; any data item, regardless 
of size, may be placed starting at any memory address. The 
NS32016 provides a special control signal, High Byte En- 
able (HBE), which facilitates individual byte addressing on a 
16-bit bus. 
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Memory is intended to be organized as two eight-bit banks, 
each bank receiving the word address (A1-—A23) in parallel. 
One bank, connected to Data Bus pins ADO-AD7, is en- 
abled to respond io even byte addresses; i.e., when the 
least significant address bit (AO) is low. The other bank, 
connected to Data Bus pins AD8-AD15, is enabled when 
HBE is low. See Figure 3-11. 





HBE 





16 BITS DATA 
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FIGURE 3-11. Memory Interface 


Any bus cycle falls into one of three categories: Even Byte 
Access, Odd Byte Access, and Even Word Access. All ac- 
cesses to any data type are made up of sequences of these 
cycles. Table 3-1 gives the state of AO and HBE for each 
category. 


TABLE 3-1 
Bus Cycle Categories 
Category HBE AO 
Even Byte 1 0 
Odd Byte 0 1 
Even Word 0 0 


Accesses of operands requiring more than one bus cycle 
are performed sequentially, with no idle T-States separating 
them. The number of bus cycles required to transfer an op- 
erand depends on its size and its alignment (i.e., whether it 
starts on an even byte address or an odd byte address). 
Table 3-2 lists the bus cycle performed for each situation. 
For the timing of AO and HBE, see Section 3.4. 
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3.0 Functional Description (Continued) 


TABLE 3.2 
Access Sequences 
Address HBE AO 


A. Odd Word Access Sequence 


Odd Byte 0 Byte 0 Don’t Care 
Even Byte Don’t Care Byte 1 


B. Even Double-Word Access Sequence 


yre3 | ByTe2 | ByTe1 | BYTEO | 


Even Word A 0 Byte 1 
Even Word A+2 0 Byte 3 


C. Odd Double-Word Access Sequence 


BYTE 3 BYTE 2 BYTE 1 BYTE 0 


Odd Byte A 0 Don’t Care 
Even Word At+1 0 Byte 1 


Even Byte A+3 1 Don’t Care Byte 3 


D. Even Quad-Word Access Sequence 


BYTE 7 BYTE 6 BYTE 5 BYTE 4 BYTE 3 BYTE 2 BYTE 1 BYTE 0 
1 0 0 


Even Word A Byte 0 
2 Even Word At+2 0 Byte 2 


Other bus cycles (instruction prefetch or slave) can occur here. 


3 Even Word A+4 Byte 4 
4 Even Word A+6 Byte 6 


E. Odd Quad-Word Access Sequence 


BYTE 7 BYTE 6 BYTE 5 BYTE 4 BYTE 3 BYTE 2 BYTE 1 BYTE 0 —A 


1 Odd Byte A 0 1 Don’t Care 
2 Even Word A+1 0 0 Byte 1 
3 Even Byte A+3 1 0 Byte 3 


Other bus cycles (instruction prefetch or slave) can occur here. 


Odd Byte A+4 Don’t Care 
Even Word A+5 Byte 5 
Even Byte A+7 Don’t Care Byte 7 
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3.0 Functional / 3.0Functional Descriptioncamey) = tssts~=sw (Continued) 


3.4.3.1 Bit Accesses 

The Bit Instructions perform byte accesses to the byte con- 
taining the designated bit. The Test and Set Bit instruction 
(SBIT), for example, reads a byte, alters it, and rewrites it, 
having changed the contents of one bit. 


3.4.3.2 Bit Field Accesses 


An access to a Bit Field in memory always generates a Dou- 
ble-Word transfer at the address containing the least signifi- 
cant bit of the field. The Double Word is read by an Extract 
instruction; an Insert instruction reads a Double Word, modi- 
fies it, and rewrites it. 


3.4.3.3 Extending Multiply Accesses 


The Extending Multiply Instruction (MEI) will return a result 
which is twice the size in bytes of the operand it reads. If the 
multiplicand is in memory, the most-significant half of the 
result is written first (at the higher address), then the least- 
significant half. This is done in order to support retry if this 
instruction is aborted. 


3.4.4 Instruction Fetches 


Instructions for the NS32016 CPU are “prefetched”; that is, 
they are input before being needed into the next available 
entry of the eight-byte Instruction Queue. The CPU performs 
two types of Instruction Fetch cycles: Sequential and Non- 
Sequeniial. These can be distinguished from each other by 
their differing status combinations on pins STO—ST3 (Sec- 
tion 3.4.2). 
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A Sequential Fetch will be performed by the CPU whenever 
the Data Bus would otherwise be idie and the Instruction 
Queue is not currently full. Sequential Fetches are always 
Even Word Read cycles (Table 3-1). 


A Non-Sequential Fetch occurs as a result of any break in 
the normally sequential flow of a program. Any jump or 
branch instruction, a trap or an interrupt will cause the next 
Instruction Fetch cycle to be Non-Sequential. In addition, 
certain instructions flush the instruction queue, causing the 
next instruction fetch to display Non-Sequential status. Only 
the first bus cycle after a break displays Non-Sequential 
status, and that cycle is either an Even Word Read or an 
Odd Byte Read, depending on whether the destination ad- 
dress is even or odd. 


3.4.5 Interrupt Control Cycles 


Activating the INT or NMI pin on the CPU will initiate one or 
more bus cycles whose purpose is inerrupt control rather 
than the transfer of instructions or data. Execution of the 
Return from Interrupt instruction (RETI) will also cause Inter- 
rupt Conirol bus cycles. These differ from instruction or data 
transfers only in the status presented on pins STO-STS3. All 


interrupt Control cycles are single-byte Read cycles. 


This section describes only the Interrupt Control sequences 
associated with each interrupt and with the return from its 
service routine. For full details of the NS32016 interrupt 
structure, see Section 3.8. 
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3.0 Functional Description (continued 


TABLE 3-3 
interrupt Sequences 


Cycle Status Address DDIN HBE AQ High Bus 


A. Non-Maskable interrupt Control Sequences. 


Interrupt Acknowledge 
1 0100 FFFFO046 Don’t Care Don’t Care 


Interrupt Return 


None: Performed through Return from Trap (RETT) instruction. 


B. Non-Vectored Interrupt Control Sequences. 


interrupt Acknowledge 
1 0100 FFFEOQO01¢ Don’t Care Don’t Care 


Interrupt Return 
None: Performed through Return from Trap (RETT) instruction. 
C. Vectored Interrupt Sequences: Non-Cascaded. 


Interrupt Acknowledge 
1 0100 FFFEQ0i6 Don't Care Vector: 
Range: 0-127 


Interrupt Return 

1 0110 FFFEOOi, Don’t Care Vector: Same as 
in Previous Int. 
Ack. Cycle 


D. Vectored Interrupt Sequences: Cascaded. 


Interrupt Acknowledge 
1 0100 FFFE00i¢ Don’t Care Cascade Index: 
range —16 to —1 


(The CPU here uses the Cascade Indx to find the Cascade Address.) 
2 0101 Cascade 0 lor Oor Vector, range 0-255; on appropriate 
Address o* 1* half of Data Bus for even/odd address 


interrupt Return 

1 0110 FFFE001, Don’t Care Cascade Index: 
same as in 
previous Int. 
Ack. Cycle 


(The CPU here uses the Cascade Index to find the Cascade Address.) 
2 0111 Cascade 0 1or Oor Don’t Care Don’t Care 
Address o* 1* 


* If the Cascaded ICU Address is Even (AO is low), then the CPU applies HBE high and reads the vector number from bits 0-7 of the Data Bus. 


If the address is Odd (A0 is high), then the CPU applies HBE low and reads the vector number from bits 8-15 of the Data Bus. The vector number may be in the 
range 0-255. 
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3.0 Functional Description (continued) . | 


3.4.6 Slave Processor Communication 


in addition to its use as the Address Translation sirap (Sec- 
tion 3.5.1), the AT/SPC pin is used as the data strobe for AT/SPC 
Slave Processor transfers. In this role, it is referred to as NS32016 

Slave Processor Control (SPC). In a slave processor bus cPu 

cycle, data is transferred on the Data Bus (ADO-AD15), and 
the Status Lines STO-ST3 are monitored by each Slave 
Processor in order to determine the type of transfer being 
performed. SPC is bidirectional, but is driven by the CPU 
during all Slave Processor bus cycles. See Section 3.9 for 


full protocol sequences. TL/EE/S054-23 
FIGURE 3-12. Slave Processor Connections 


AD(0-15) 
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TL/EE/5054-24 
Note: 

(1) CPU samples Data Bus here. 

(2) DBE and ail other NS32201 TCU bus signals remain inactive because no ADS pulse is received from the CPU. 
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FIGURE 3-13. CPU Read from Slave Processor 
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3.0 Functional Description (Continued) 


3.4.6.1 Slave Processor Bus Cycles 


A Slave Processor bus cycle always takes exactly two clock 
cycles, labeled T1 and T4 (see Figures 3-73 and 3-14). 
During a Read cycle, SPC is activated at T1, data is sam- 
pled at T4, and SPC is removed. The Cycle Status pins lead 
the cycle by one clock period, and are sampled at the lead- 
ing edge of SPC. During a Write cycle, the CPU applies data 
and activates SPC at T1, removing SPC at T4. The Slave 
Processor latches status on the leading edge of SPC and 
latches data on the trailing edge. 

Since the CPU does not pulse the Address Strobe (ADS), 
no bus signals are generated by the NS32201 Timing Con- 
trol Unit. The direction of a transfer is determined by the 
sequence (‘protocol’) established by the instruction under 


PREV. CYCLE 
| T4 ORTi 


Note: 


(1) Slave Processor samples data bus here. 


execution; but the CPU indicates the direction on the DDIN 
pin for hardware debugging purposes. 


3.4.6.2 Slave Operand Transfer Sequences 


A Slave Processor operand is transferred in one or more 
Slave bus cycles. A Byte operand is transferred on the 
least-significant byte of the Data Bus (ADO-AD7), and a 
Word operand is transferred on the entire bus. A Double 
Word is transferred in a consecutive pair of bus cycles, 
least-significant word first. A Quad Word is transferred in 
two pairs of Slave cycles, with other bus cycles possibly 
occurring between them. The word order is from least-signif- 
icant word to most-significant. 


NEXT CYCLE 
TIORTI | 


TL/EE/5054—25 


(2) DBE, being provided by the NS32201 TCU, remains inactive due to the fact that no pulse is presented on ADS. 
TCU signals RD, WA and TSO also remain inactive. 


FIGURE 3-14. CPU Write to Slave Processor 
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3.5 MEMORY MANAGEMENT OPTION 


The NS32016 CPU, in conjunction with the NS32082 Mem- 
ory Management Unit (MMU), provides full support for ad- 
dress translation, memory protection, and memory alloca- 
tion techniques up to and including Virtual Memory. 


3.5.1 Address Translation Strap 

The Bus interface Control section of the NS32016 CPU has 
two bus timing modes: With or Without Address Translation. 
The mode of operation is selected by the CPU by sampling 
the AT/SPC (Address Translation/Slave Processor Control) 
pin on the rising edge of the RST (Reset) pulse. If AT/SPC 
is sampled as high, the bus timing is as previously 


| T4 OR Ti | 1 


| tm 
of Pt of | fd 





HBE 





RDY 





described in Section 3.4. If it is sampled as low, two chang- 
eS occur: 


1) An extra clock cycle, Tmmu, is inserted into all bus 
cycles except Slave Processor transfers. 


2) The DS/FLT pin changes in function from a Data 
Sirobe output (DS) to a Float Command input (FLT). 


The NS32082 MMU will itself pull the CPU AT/SPC pin low 
when it is reset. In non-Memory-Managed systems this pin 
should be pulled up to Voc through a 10 kf resistor. 


Note that the Address Translation strap does not specifical- 
ly declare the presence of an NS32082 MMU, but only the 


T2 | T3 | T4 | T1ORTi | 


a [| 


a es ey Se as CS Os OR OO GS OD 


NEXT ADDR 





+ NEXT ADDR 
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FIGURE 3-15. Read Cycle with Address Translation (CPU Action) 
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3.0 Functional Description (Continued 


presence of external address translation circuitry. MMU in- 
structions will still trap as being undefined unless the 
SETCFG (Set Configuration) instruction is executed to de- 
clare the MMU instruction set valid. See Section 2.1.3. 


3.5.2 Translated Bus Timing 


Figures 3-15 and 3-176 illustrate the CPU activity during a 
Read cycle and a Write cycle in Address Translation mode. 
The additional T-State, Tmmu, is inserted between T1 and 
T2. During this time the CPU places ADO-AD15 and A16- 
A23 into the TRI-STATE® mode, allowing the MMU to as- 
sert the translated address and issue the physical address 
strobe PAV. T2 through T4 of the cycle are identical to their 
counter-parts without Address Translation, with the excep- 

















R 


\ 





T4 ORTi | T1 Tmmu 


PLILILILILILI LU 
old 





w---4----- an NEXT AD 


DATA OUT ) NEXT ADDR 


STATUS VALID | 


FIGURE 3-16. Write Cycle with Address Translation (CPU Action) 









tion that the CPU Address lines A16-A23 remain in the 
TRI-STATE condition. This allows the MMU to continue as- 
serting the translated address on those pins. 


Note that in order for the NS32082 MMU to operate correct- 
ly it must be set to the 32016 mode by forcing A24/HBF 
high during reset. 


Figures 3-17 and 3-18 show a Read cycle and a Write cycle 
as generated by the 32016/32082/32201 group. Note that 
with the CPU ADS signal going only to the MMU, and with 
the MMU PAV signal substituting for ADS everywhere else, 
Tmmu through T4 look exactly like T1 through T4 in a non- 
Memory-Managed system. For the connection diagram, see 
Appendix B. 
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NS32016 CPU BUS SIGNALS 
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FIGURE 3-17. Memory-Managed Read Cycle 
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3.0 Functional Description (Continued 


NS32016 CPU BUS SIGNALS 
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FIGURE 3-18. Memory-Managed Write Cycle 
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3.0 Functional Description (continued | 


= 
w” 
Go 
FLT —_ S 
3.5.3 The FLT (Float) Pin 1) Sets ADO-AD15, A16-A23 and DDIN to the TRI- — 
The FLT pin is used by the CPU for address translation STATE condition (“floating”). ri 
support. Activating FLT during Tmmu causes the CPU to 2) Sets HBE low. ~~ 
wait longer than Tmmu for address translation and valida- 3) Suspends further internal processing of the current in- A 
tion. This feature is used occasionally by the NS32082 MMU struction. This ensures that the current instruction re- Go 
in order to update its internal translation look.aside buffer mains abortable with retry. (See RST/ABT description, 8 
(TLB) from page tables in memory, or to update certain Section 3.5.4.) . > 
status bits within them. — oe Note that the ADO-AD15 pins may be briefly asserted dur- @ 
Figure 3-19 shows the effects of FLT. Upon sampling FLT ing the first idle T-State. The above conditions remain in | Z 
low, late in Tmmu, the CPU enters idle T-States (Tf) during effect until FLT again goes high. See the Timing Specifica- | 2 
which it: tions, Section 4. DS 
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FIGURE 3-19. FLT Timing 
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3.0 Functional Description (continued) 


3.5.4 Aborting Bus Cycles 


The RST/ABT pin, apart from its Reset function (Section 
3.3), also serves as the means to “abort,” or cancel, a bus 
cycle and the instruction, if any, which initiated it. An Abort 
request is distinguished from a Reset in that the RST/ABT 
pin is held active for only one clock cycle. 


lf RST/ABT is pulled low during Tmmu or Tf, this signals 
that the cycle must be aborted. The CPU itself will enter T2 
and then Ti, thereby terminating the cycle. Since it is the 
MMU PAV signal which triggers a physical cycle, the rest of 
the system remains unaware that a cycle was started. 


The NS32082 MMU will abort a bus cycle for either of two 
reasons: 


1) The CPU is attempting to access a virtual address 
which is not currently resident in physical memory. The 
reference page must be brought into physical memory 
from mass storage to make it accessible to the CPU. 


2) The CPU is attempting to perform an access which is 
not allowed by the protection level assigned to that 
page. 

When a bus cycle is aborted by the MMU, the instruction 

that caused it to occur is also aborted in such a manner that 

it is guaranteed re-executable later. The information that is 
changed irrecoverably by such a partly-executed instruction 
does not affect its re-execution. 


3.5.4.1 The Abort Interrupt 


Upon aborting an instruction, the CPU immediately performs 
an interrupt through the ABT vector in the Interrupt Table 
(see Section 3.8). The Return Address pushed on the Inter- 
rupt Stack is the address of the aborted instruction, so that 
a Return from Trap (RETT) instruction will automatically re- 
try it. 

The one exception to this sequence occurs if the aborted 
bus cycle was an instruction prefetch. If so, it is not yet 
certain that the aborted prefetched code is to be executed. 
Instead of causing an interrupt, the CPU only aborts the bus 
cycle, and stops prefetching. If the information in the In- 
struction Queue runs out, meaning that the instruction will 
actually be executed, the ABT interrupt will occur, in effect 
aborting the instruction that was being fetched. 


3.5.4.2 Hardware Considerations 

In order to guarantee instruction retry, certain rules must be 

followed in applying an Abort to the CPU. These rules are 

followed by the NS32082 Memory Management Unit. 

1) If FLT has not been applied to the CPU, the Abort 
pulse must occur during or before Tmmu. See the Tim- 
ing Specifications, Figure 4-22. 
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it FLT has been applied to the CPU, the Abort pulse 
must be applied before the T-State in which FLT goes 
inactive. The GPU will not actually respond to the Abort 
command until FLT is removed. See Figure 4-23. 


The Write half of a Read-Modify-Write operand access 
may not be aborted. The CPU guarantees that this will 
never be necessary for Memory Management funtions 
by applying a special RMW status (Status Code 1011) 
during the Read half of the access. When the CPU 
presents RMW status, that cycle must be aborted if it 
would be illegal to write to any of the accessed ad- 
dresses. 


lf RST/ABT is pulsed at any time other than as indicated 
above, it will abort either the instruction currently under exe- 
cution or the next instruction and will act as a very high-pri- 
ority interrupt. However, the program that was running at the 
time is not guaranteed recoverable. 


3.6 BUS ACCESS CONTROL 


The NS32016 CPU has the capability of relinquishing its 
access to the bus upon request from a DMA device or an- 
other CPU. This capability is implemented on the HOLD 
(Hold Request) and HLDA (Hold Acknowledge) pins. By as- 
serting HOLD low, an external device requests access to 
the bus. On receipt of HLDA from the CPU, the device may 
perform bus cycles, as the CPU at this point has set the 
ADO-AD15, A16-A23, ADS, BDIN and HBE pins to the 
TRI-STATE condition. To return control of the bus to the 
CPU, the device sets HOLD inactive, and the CPU acknowl- 
edges return of the bus by setting HLDA inactive. 


How quickly the CPU releases the bus depends on whether 
it is idle on the bus at the time the HOLD request is made, 
as the CPU must always complete the current bus cycle. 
Figure 3-20 shows the timing sequence when the CPU is 
idle. In this case, the CPU grants the bus during the immedi- 
ately following clock cycle. Figure 3-27 shows the sequence 
if the CPU is using the bus at the time that the HOLD re- 
quest is made. If the request is made during or before the 
clock cycle shown (two clock cycles before T4), the CPU 
will release the bus during the clock cycle following T4. If 
the request occurs closer to T4, the CPU may already have 
decided to initiate another bus cycle. In that case it will not 
grant the bus until after the next T4 state. Note that this 
situation will also occur if the CPU is idle on the bus but has 
initiated a bus cycle internally. 

In a Memory-Managed system, the HLDA signal is connect- 
ed in a daisy-chain through the NS32082, so that the MMU 
can release the bus if it is using it. 
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FIGURE 3-20. HOLD Timing, Bus Initially Idie 
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3.0 Functional Description (continued) | 


3.7 INSTRUCTION STATUS 


In addition to the four bits of Bus Cycle status (STO-ST3), 
the NS32016 CPU also presents Instruction Status informa- 
tion on three separate pins. These pins differ from STO- 
ST3 in that they are synchronous to the CPU’s internal in- 
struction execution section rather than to its bus interface 
section. 


| PFS (Program Flow Status) is pulsed low as each instruction 
begins execution. It is intended for debugging purposes, and 
is used that way by the NS32082 Memory Management 
| Unit. 
U/S originates from the U bit of the Processor Status Regis- 
ter, and indicates whether the CPU is currently running in 
User or Supervisor mode. It is sampled by the MMU for 
mapping, protection and debugging purposes. Although it is 
not synchronous to bus cycles, there are guarantees on its 
validity during any given bus cycle. See the Timing Specifi- 
cations, Figure 4-21. 
ILO (Interlocked Operation) is activated during an SBITI (Set 
Bit, Interlocked) or CBIT! (Clear Bit, Interlocked} instruction. 
It is made available to external bus arbitration circuitry in 
order to allow these instructions to implement the sema- 
phore primitive operations for multi-processor communica- 
tion and resource sharing. As with the U/S pin, there are 
guarantees on its validity during the operand accesses per- 
formed by the instructions. See the Timing Specification 
Section, Figure 4-19. 


3.8 NS32016 INTERRUPT STRUCTURE 


INT, on which maskable interrupts may be requested, 
NMI, on which non-maskable interrupts may be request- 
ed, and 

RST/ABT, which may be used to abort a bus cycle and 
any associated instruction. See Seciion 3.5.4. 





J CASCADE ADDRO 


CASCADE TABLE 


CASCADE ADOR 14 


CASCADE ADDR 15 


FIXED INTERRUPTS 





INTERRUPT BASE 
REGISTER 






; INTERRUPTS r 


anprears DISPATCH TABLE 
ISP, 
VECTORED a 6 


FIGURE 3-22. Interrupt Dispatch and Cascade Tabies 


In addition, there is a set of iniernally-generated “traps” 
which cause interrupt service to be performed as a result 
either of exceptional conditions (e.g., attempted division by 
zero) or of specific instructions whose purpose is to cause a 
trap to occur (e.g., the Supervisor Call instruction). 


3.8.1 General Interrupt/Trap Sequence 


Upon receipt of an interrupt or trap request, the CPU goes 

through four major steps: 

1) Adjustment of Registers. 
Depending on the source of the interrupt or trap, the 
CPU may restore and/or adjust the contents of the 
Program Counter (PC), the Processor Status Register 
(PSR) and the currently-selected Stack Pointer (SP). A 
copy of the PSR is made, and the PSR is then set to 
reflect Supervisor Mode and selection of the Interrupt 
Stack. 


2) Saving Processor Status. 


The PSR copy is pushed onto the Interrupt Stack as a 
16-bit quantity. 


3) Vector Acquisition. 


A Vector is either obtained from the Data Bus or is 
supplied by default. 


4) Service Call. 


The Vector is used as an index into the interrupt Dis- 
patch Table, whose base address is taken from the 
CPU Interrupt Base (INTBASE) Register. See Figure 
3-22. A 32-bit External Procedure Descriptor is read 
from the table entry, and an External Procedure Call is 
performed using it. The MOD Register (16 bits) and 
Program Counter (32 bits) are pushed on the Interrupt 
Stack. 
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3.0 Functional Description (Continued 


This process is illustrated in Figure 3-23, from the viewpoint 
of the programmer. 

Full sequences of events in processing interrupts and traps 
may be found as follows: 


RETURN ADDRESS (PUSH) 
STATUS MODULE 
(PUSH) 


INTBASE REGISTER 


INTERRUPT BASE 
VECTOR 


DESCRIPTOR 


16 
OFFSET 


MODULE TABLE ENTRY 





PROGRAM COUNTER 


32 
[ eserves) 





Interrupt on INT or NMI pin: Section 3.8.7.1. 
Abort Interrupt: Section 3.8.7.4. 
Traps (except Trace): Section 3.8.7.2. 
Trace Trap: Section 3.8.7.3. 


32 BITS 


32 BITS 


INTERRUPT 
STACK 
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ENTRY POINT ADDRESS NEW STATIC BASE 


FIGURE 3-23. Interrupt/Trap Service Routine Calling Sequence 


TL/EE/5054-35 





9.310 


3.0 Functional Description (Continued | 


3.8.2 Interrupt/Trap Return 


To return control to an interrupted program, one of two in- 
structions is used. The RETT (Return from Trap) instruction 
(Figure 3-24) restores the PSR, MOD, PC and SB registers 
to their previous contents and, since traps are. often used 
deliberately as a call mechanism for Supervisor Mode pro- 
cedures, it also discards a specified number of bytes from 
the original stack as surplus parameter space. RETT is used 
to return from any trap or interrupt except the Maskable 
Interrupt. For this, the RETI (Return from Interrupt) instruc- 
tion is used, which also informs any external Interrupt Con- 
trol Units that interrupt service has completed. Since inter- 
rupts are generally asynchronous external events, RETI 
does not pop parameters. See Figure 3-25. , 


3.8.3 Maskable Interrupts (The INT Pin) 


The INT pin is a level-sensitive input. A continuous low level 
is allowed for generating multiple interrupt requests. The 


PROGRAM COUNTER 


input is maskable, and is therefore enabled to generate in- 
terrupt requests only while the Processor Status Register | 
bit is set. The | bit is automatically cleared during service of 
an INT, NMI or Abort request, and is restored to its original 
setting upon return from the interrupt service routine via the 
RETT or RETI instruction. 


The INT pin may be configured via the SETCFG instruction 
as either Non-Vectored (CFG Register bit |=0) or Vectored 
(bitI=1). 


3.8.3.1 Non-Vectored Mode 


In the Non-Vectored mode, an interrupt request on the INT 
pin will cause an Interrupt Acknowledge bus cycle, but the 
CPU will ignore any value read from the bus and use instead 
a default vector of zero. This mode is useful for small sys- 
tems in which hardware interrupt prioritization is unneces- 
sary. 
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3.0 Functional Description (Continued) 
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FIGURE 3-25. Return from Interrupt (RET) Instruction Flow 
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3.0 Functional Description (Continued) 


3.8.3.2 Vectored Mode: Non-Cascaded Case 


In the Vectored mode, the CPU uses an Interrupt Control 
Unit (ICU) to prioritize up to 16 interrupt requests. Upon re- 
ceipt of an interrupt request on the INT pin, the CPU per- 
forms an “Interrupt Acknowledge, Master” bus cycle (Sec- 
tion 3.4.2) reading a vector value from the low-order byte of 
the Data Bus. This vector is then used as an index into the 
Dispatch Table in order to find the External Procedure De- 
scriptor for the proper interrupt service procedure. The serv- 
ice procedure eventually returns via the Return from Inter- 
rupt (RETI) instruction, which performs an End of Interrupt 
bus cycle, informing the ICU that it may re-prioritize any in- 
terrupt requests still pending. The ICU provides the vector 
number again, which the CPU uses to determine whether it 
needs also to inform a Cascaded ICU (see below). 


in a system with only one ICU (16 levels of interrupt), the 
vectors provided must be in the range of 0 through 127; that 
is, they must be positive numbers in eight bits. By providing 
a negative vector number, an ICU flags the interrupt source 
as being a Cascaded ICU (see below). 


3.8.3.3 Vectored Mode: Cascaded Case 


In order to allow up to 256 levels of interrupt, provision is 
made both in the CPU and in the NS32202 Interrupt Control 
Unit (ICU) to transparently support cascading. Figure 3-27 
shows a typical cascaded configuration. Note that the Inter- 
rupt output from a Cascaded ICU goes to an Interrupt Re- 
quest input of the Master ICU, which is the only ICU which 
drives the CPU INT pin. 


In a system which uses cascading, two tasks must be per- 

formed upon initialization: 

1) For each Cascaded ICU in the system, the Master ICU 
must be informed of the line number (0 to 15) on which 
it receives the cascaded requests. 


2) A Cascade Table must be established in memory. The 
Cascade Table is located in a NEGATIVE direction 
from the location indicated by the CPU Interrupt Base 
(INTBASE) Register. Its entries are 32-bit addresses, 
pointing to the Vector Registers of each of up to 16 
Cascaded ICUs. 





DATA 


(8) 
CONTROL 









NS32016 


NS32202 


FIGURE 3-26. Interrupt Control Unit Connections (16 Levels) 


Figure 3-22 illustrates the position of the Cascade Table. To 
find the Cascade Table entry for a Cascaded ICU, take its 
Master ICU line number (0 to 15) and subtract 16 from it, 
giving an index in the range —16 to —1. Multiply this value 
by 4, and add the resulting negative number to the contents 
of the INTBASE Register. The 32-bit entry at this address 
must be set to the address of the Hardware Vector Register 
of the Cascaded ICU. This is referred to as the “Cascade 
Address.” 


Upon receipt of an interrupt request from a Cascaded ICU, 
the Master ICU interrupts the CPU and provides the nega- 
tive Cascade Table index instead of a (positive) vector num- 
ber. The CPU, seeing the negative value, uses it as an index 
into the Cascade Table and reads the Cascade Address 
from the referenced entry. Applying this address, the CPU 
performs an “Interrupt Acknowledge, Cascaded”’ bus cycle 
(Section 3.4.2), reading the final vector value. This vector is 
interpreted by the CPU as an unsigned byte, and can there- 
fore be in the range of 0 through 255. 


in returning from a Cascaded interrupt, the service proce- 

dure executes the Return from Interrupt (RETI) instruction, 

as it would for any Maskable Interrupt. The CPU performs 

an “End of Interrupt, Master” bus cycle (Section 3.4.2), 

whereupon the Master ICU again provides the negative 

Cascaded Table index. The CPU, seeing a negative value, 

uses it to find the corresponding Cascade Address from the 

Cascade Table. Applying this address, it performs an “End 

of Interrupt, Cascaded” bus cycle (Section 3.4.2), informing 

the Cascaded ICU of the completion of the service routine. 

The byte read from the Cascaded ICU is discarded. 

Note: if an interrupt must be masked off, the CPU can do so by setting the 
corresponding bit in the interrupt mask register of the interrupt con- 
troller. However, if an interrupt is set pending during the CPU instruc- 
tion that masks off that interrupt, the CPU may still perform an inter- 
tupt acknowledge cycle following that instruction since it might have 
sampled the INT line before the ICU deasserted it. This could cause 
the ICU to provide an invalid vector. To. avoid this problem the above 
operation should be performed with the CPU interrupt disabled. 
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3.0 Functional Description (Continued 
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FIGURE 3-27. Cascaded Interrupt Control Unit Connections 


3.8.4 Non-Maskable interrupt (The NMI Pin) 


The Non-Maskable Interrupt is triggered whenever a falling 
edge is detected on the NMI pin. The CPU performs an 
“Interrupt Acknowledge, Master’ bus cycle (Section 3.4.2) 
when processing of this interrupt actually begins. The Inter- 
rupt Acknowledge cycle differs from that provided for Mask- 
able Interrupts in that the address presented is FFFFO04.. 
The vector value used for the Non-Maskable Interrupt is 
taken as 1, regardless of the value read from the bus. 

The service procedure returns from the Non-Maskable In- 
terrupt using the Return from Trap (RETT) instruction. No 
special bus cycles occur on return. 


For the full sequence of events in processing the Non- 
Maskabie Interrupt, see Section 3.8.7.1. 
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3.8.5 Traps 


A trap is an internally-generated interrupt request caused as 
a direct and immediate result of the execution of an instruc- 
tion. The Return Address pushed by any trap except Trap 
(TRC) below is the address of the first byte of the instruction 
during which the trap occurred. Traps do not disable inter- 
rupts, as they are not associated with external events. Traps 
recognized by NS32016 CPU are: 


Trap (Slave): An exceptional condition was detected by the 
Floating Point Unit or another Slave Processor during the 
execution of a Slave Instruction. This trap is requested via 
the Status Word returned as part of the Slave Processor 
Protocol (Section 3.9.1). 











3.0 Functional Description (continue) 
Trap (ILL): Illegal operation. A privileged operation was at- 
tempted while the CPU was in User Mode (PSR bit U= 1). 
Trap (SVC): The Supervisor Cal! (SVC) instruction was exe- 
cuted. 


Trap (DVZ): An attempt was made to divide an integer by 
zero. (The FPU trap is used for Floating Point division by 
zero.) 
Trap (FLG): The FLAG instruction detected a “1” in the 
CPU PSR F bit. 


Trap (BPT): The Breakpoint (BPT) instruction was execut- 
ed. 

Trap (TRC): The instruction just completed is being traced. 
See below. 

Trap (UND): An undefined opcode was encountered by the 
CPU. 


A special case is the Trace Trap (TRC), which is enabled by 
setting the T bit in the Processor Status Register (PSR). At 
the beginning of each instruction, the T bit is copied into the 
PSR P (Trace “Pending”) bit. If the P bit is set at the end of 
an instruction, then the Trace Trap is activated. lf any other 
trap or interrupt request is made during a traced instruction, 
its entire service procedure is allowed to complete before 
the Trace Trap occurs. Each interrupt and trap sequence 
handles the P bit for proper tracing, guaranteeing one and 
only one Trace Trap per instruction, and guaranteeing that 
the Return Address pushed during a Trace Trap is always 
the address of the next instruction to be traced. 


3.8.6 Prioritization 


The NS32016 CPU internally prioritizes simultaneous inter- 
rupt and trap requests as follows: 


1) Traps other than Trace (Highest priority) 
2) Abort 

3) Non-Maskable Interrupt 

4) Maskable Interrupts 

5) Trace Trap (Lowest priority) 


3.8.7 Interrupt/Trap Sequences: Detail Flow 


For purposes of the following detailed discussion of inter- 
rupt and trap service sequences, a single sequence called 
“Service” is defined in Figure 3-28. Upon detecting any in- 
terrupt request or trap condition, the CPU first performs a 
sequence dependent upon the type of interrupt or trap. This 
sequence will include pushing the Processor Status Regis- 
ter and establishing a Vector and a Return Address. The 
CPU then performs the Service sequence. 


For the sequenced followed in processing either Maskable 
or Non-Maskable Interrupts (on the INT or NMI pins, respec- 
tively), see Section 3.8.7.1. For Abort interrupts, see Section 
3.8.7.4. For the Trace Trap, see Section 3.8.7.3, and for all 
other traps see Section 3.8.7.2. 


3.8.7.1 Maskable/Non-Maskable Interrupt Sequence 


This sequence is performed by the CPU when the NMI pin 
receives a falling edge, or the INT pin becomes active with 
the PSR | bit set. The interrupt sequence begins either at 
the next instruction boundary or, in the case of the String 
instructions, at the next interruptible point during its execu- 
tion. 


—- 


ft a String instruction was interrupted and not yet com- 
pleted: 


a. Clear the Processor Status Register P bit. 
b. Set “Return Address” to the address of the first 
byte of the interrupted instruction. 
Otherwise, set “Return Address” to the address of the 
next instruction. 
2. Copy the Processor Status Register (PSR) into a tem- 
porary register, then clear PSR bits S, U, T, P and I. 


3. If the interrupt is Non-Maskable: 

a. Read a byte from address FFFF004¢, applying 
Status Code 0100 (Interrupt Acknowledge, Mas- 
ter: Section 3.4.2). Discard the byte read. 

b. Set “Vector” to 1. 

c. Go to Step 8. 

4. If the interrupt is Non-Vectored: 


a. Read a byte from address FFFF004,, applying 
Status Code 0100 (Interrupt Acknowledge, Mas- 
ter: Section 3.4.2). Discard the byte read. 


Si 2. pe rN 


b. Set ‘Vector’ to 0. 
c. Go to Step 8. 


5. Here the interrupt is Vectored. Read “Byte” from ad- 
dress FFFE001¢, applying Status Code 0100 (interrupt 
Acknowledge, Master: Section 3.4.2). 

6. lf “Byte” > 0, then set “Vector” to “Byte” and go to 
Step 8. 

7. If “Byte” is in the range —16 through —1, then the 
interrupt source is Cascaded. (More negative values 
are reserved for future use.) Perform the following: 

a. Read the 32-bit Cascade Address from memory. 
The address is calculated as INTBASE + 4* Byte. 

b. Read “Vector,” applying the Cascade Address 
just read and Status Code 0101 (Interrupt Ac- 
knowledge, Cascaded: Section 3.4.2). 

8. Push the PSR copy (from Step 2) onto the Interrupt 
Stack as a 16-bit value. 


9. Perform Service (Vector, Return Address), Figure 3-28. 


Service (Vector, Return Address): 


1) Read the 32-bit External Procedure Descriptor from the Interrupt Dis- 
patch Table: address is Vector*4+ INTBASE Register contents. 


2) +Move the Module field of the Descriptor into the MOD Register. 


3) Read the new Static Base pointer from the memory address contained 
in MOD, placing it into the SB Register. 


4) Read the Program Base pointer from memory address MOD+8, and 
add to it the Offset field from the Descriptor, placing the result in the 
Program Counter. 

5) Flush Queue: Non-sequentially fetch first instruction of Interrupt Rou- 
tine. 

6) Push MOD Register onto the Interrupt Siack as a 16-bit value. (The 
PSR has already been pushed as a 16-bit value.) 

7) Push the Return Address onto the interrupt Stack as a 32-bit quantity. 


FIGURE 3-28. Service Sequence 
Invoked during all interrupt/trap sequences 
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3.0 Functional Description (continued) 


3.8.7.2 Trap Sequence: Traps Other Than Trace 


1) Restore the currently selected Stack Pointer and the 
Processor Status Register to their original values at the 
start of the trapped instruction. 


2) Set “Vector” to the value corresponding to the trap 
type. 
SLAVE: 


ILL: 
Svc: 
DV2: 
FLG: 


Vector=3. 

Vector=4, 

Vector=5. 

Vector=6. 

Vector=7. 
BPT: Vector=8. 
UND: Vector = 10. 

3) Copy the Processor Status Register (PSR) into a tem- 
porary register, then clear PSR bits S, U, P and T. 

4) Push the PSR copy onto the Interrupt Stack as a 16-bit 
value. 

5) Set “Return Address” to the address of the first byte of 
the trapped instruction. 


6) Perform Service (Vector, Return Address), Figure 3-28. 


3.8.7.3 Trace Trap Sequence 
1) Inthe Processor Status Register (PSR), clear the P bit. 


2) Copy the PSR into a temporary register, then clear 
PSR bits S, U and T. 


3) Push the PSR copy onto the Interrupt Stack as a 16-bit 
value. 


4) Set “Vector” to 9. 


5) Set “Return Address” to the address of the next in- 
struction. 


6) Perform Service (Vector, Return Address), Figure 3-28. 


3.8.7.4 Abort Sequence 


1) Restore the currently selected Stack Pointer to its origi- 
nal contents at the beginning of the aborted instruction. 


2) Clear the PSR P bit. 


3) Copy the PSR into a temporary register, then clear 
PSR bits S, U, T and |. 


4) Push the PSR copy onto the Interrupt Stack as a 16-bit 
value. 


5) Set “Vector” to 2. 


6) Set “Return Address” to the address of the first byte of 
the aborted instruction. 


7) Perform Service (Vector, Return Address), Figure 3-28. 


3.9 SLAVE PROCESSOR INSTRUCTIONS 


The NS32016 CPU recognizes three groups of instructions 
as being executable by external Slave Processors: 


Floating Point Instruction Set 
Memory Management Instruction Set 
Custom Instruction Set 
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Each Slave Instruction Set is validated by a bit in the Contig- 
uration Register (Section 2.1.3). Any Slave Instruction which 
does not have its corresponding Configuration Register bit 
set will trap as undefined, without any Slave Processor com- 
munication attempted by the CPU. This allows software sim- 
ulation of a non-existent Slave Processor. 


3.9.1 Slave Processor Protocol 


Slave Processor instructions have a three-byte Basic In- 

struction field, consisting of an ID Byte followed by an Oper- 

ation Word. The ID Byte has three functions: 

1) It identifies the instruction as being a Slave Processor 
instruction. 


2) It specifies which Slave Processor will execute it. 


3) It determines the format of the following Operation 

Word of the instruction. 
Upon receiving a Slave Processor instruction, the CPU initi- 
ates the sequence outlined in Figure 3-29. While applying 
Status Code 1111 (Broadcast ID, Section 3.4.2), the CPU 
transfers the ID Byte on the least-significant half of the Data 
Bus (ADO-AD7). All Slave Processors input this byte and 
decode it. The Slave Processor selected by the ID Byte is 
activated, and from this point the CPU is communicating 
only with it. If any other slave protocol was in progress (e.g., 
an aborted Slave instruction), this transfer cancels it. 
The CPU next sends the Operation Word while applying 
Status Code 1101 (Transfer Slave Operand, Section 3.4.2). 
Upon receiving it, the Slave Processor decodes it, and at 
this point both the CPU and the Slave Processor are aware 
of the number of operands to be transferred and their sizes. 
The Operation Word is swapped on the Data Bus; that is, 
bits 0-7 appear on pins AD8-AD15 and bits 8-15 appear 
on pins ADO-AD7. 
Using the Addressing Mode fields within the Operation 
Word, the CPU starts fetching operands and issuing them to 
the Slave Processor. To do so, it references any Addressing 
Mode extensions which may be appended to the Slave 
Processor instruction. Since the CPU is solely responsible 
for memory accesses, these extensions are not sent to the 
Slave Processor. The Status Code applied is 1101 (Transfer 
Slave Processor Operand, Section 3.4.2). 
Status Combinations: 
Send ID (ID): Code 1111 
Xfer Operand (OP): Code 1101 
Read Status (ST): Code 1110 

Action 

CPU Send ID Byte. 
OP CPU Sends Operation Word. 
OP CPU Sends Required Operands. 
— Slave Starts Execution. CPU Pre-Fetches. 
— Slave Pulses SPC Low. 
ST CPU Reads Status Word. (Trap? Alter Flags?) 
OP CPU Reads Results (If Any). 


FIGURE 3-29. Slave Processor Protocol 


Step Status 
1 ID 





3.0 Functional Description (continued) 


After the CPU has issued the last operand, the Slave Proc- 
essor starts the actual execution of the instruction. Upon 
completion, it will signal the CPU by pulsing SPC low. To 
allow for this, and for the Address Translation strap func- 
tion, AT/SPC is normally held high only by an internal pull- 
up device of approximately 5 kN. 

While the Slave Processor is executing the instruction, the 
CPU is free to prefetch instructions into its queue. If it fills 


the queue before the Slave Processor finishes, the CPU will. 


wait, applying Status Code 0011 (Waiting for Slave, Section 
3.4.2). 


Upon receiving the pulse on SPC, the CPU uses SPC to 
read a Status Word from the Slave Processor, applying 
Status Code 1110 (Read Slave Status, Section 3.4.2). This 
word has the format shown in Figure 3-30. \f the Q bit 
(‘‘Quit’, Bit 0) is set, this indicates that an error was detect- 
ed by the Slave Processor. The CPU will not continue the 
protocol, but will immediately trap through the Slave vector 
in the Interrupt Table. Certain Slave Processor instructions 
cause CPU PSR bits to be loaded from the Status Word. 


The last step in the protocol is for the CPU to read a result, 
if any, and transfer it to the destination. The Read cycles 
from the Slave Processor are performed by the CPU while 
applying Status Code 1101 (Transfer Slave Operand, Sec- 
tion 3.4.2). 


An exception to the protocol above is the LMR (Load Mem- 
ory Management Register) instruction, and a corresponding 


Custom Slave instruction (LCR: Load Custom Register). In 
executing these instructions, the protocol ends after the 
CPU has issued the last operand. The CPU does not wait for 
an acknowledgement from the Slave Processor, and it does 
not read status. 


3.9.2 Floating Point Instructions 


Table 3-4 gives the protocols followed for each Floating 
Point instruction. The instructions are referenced by their 
mnemonics. For the bit encodings of each instruction, see 
Appendix A. 


The Operand class columns give the Access Class for each 
general operand, defining how the addressing modes are 
interpreted (see Series 32000 Instruction Set Reference 
Manual). 


The Operand Issued columns show the sizes of the oper- 
ands issued to the Floating Point Unit by the CPU. “D” indi- 
cates a 32-bit Double Word. “i” indicates that the instruction 
specifies an integer size for the operand (B=Byte, 
W= Word, D= Double Word). “‘f’’ indicates that the instruc- 
tion specifies a Floating Point size for the operand (F=32- 
bit Standard Floating, L=64-bit Long Floating). 


The Returned Value Type and Destination column gives the 
size of any returned value and where the CPU places it. The 
PSR Bits Affected column indicates which PSR bits, if any, 
are updated from the Slave Processor Status Word (Figure 
3-30). 


TABLE 3-4 
Floating Point Instruction Protocols 


Operand 1 
Class 


read.f 
read.f 
read.f 
read.f 


Operand 2 
Class 


rmw.f 
rmw.f 
rmw.f 
rmw.f 


Mnemonic 


ADDI 
SUBf 
MULf 
DIVf 


MOvft 
ABSf 
NEGf 


CMPf 


FLOORfi 
TRUNCTi 
ROUNDfi 


MOVFL 
MOVLF 


MOVif 


LFSR 
SFSR 


Note: 

D = Double Word 

i = integer size (B,W,D) specified in mnemonic. 

c = Custom size (D:32 bits or Q:64 bits) specified in mnemonic. 
* = Privileged Instruction: will trap if CPU is in User Mode. 

N/A = Not Applicable to this instruction. 


write.f 
write.f 
write.f 


read.f 
read.f 
read.f 


read.f read.f 


write.i 
write.i 
write.i 


read.f 
read.f 
read.f 


write.L 
write.F 


read.F 
read.L 


write.f 


N/A 
write.D 


read.i 


read.D 
N/A 


Operand 1 
Issued 


> 


fC Thom; mn eR mn mh ooh ot 


2-317 


PSR Bits 
Affected 


none 
none 
none 
none 


Returned Value 
Type and Dest. 
f to Op. 2 
f to Op. 2 
f to Op. 2 
f to Op. 2 


f to Op. 2 
f to Op. 2 
f to Op. 2 


N/A 
ito Op. 2 
ito Op. 2 
ito Op. 2 
L to Op. 2 
F to Op. 2 
fto Op. 2 


N/A 
D to Op. 2 


Operand 2 
Issued 


none 
none 
none 
N,Z,L 
none 
none 
none 


none 
none 


none 


none 
none 
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| 3.0 Functional Description (continued) | 


3.9.3 Memory Management Instructions 










15 87 0 Table 3-5 gives the protocols for Memory Management in- 
structions. Encodings for these instructions may be found in 
Appendix A. 
fA_—* 


New PSR Bit Value(s) 


In executing the RDVAL and WRVAL instructions, the CPU 
“Quit”: Terminate Protocol, Trap(FPU). 


calculates and issues the 32-bit Effective Address of the 
TL/EE/5054-41 single operand. The CPU then performs a single-byte Read 
FIGURE 3-30. Slave Processor Status Word Format cycle from that address, allowing the MMU to safely abort 
the instruction if the necessary information is not currently in 
physical memory. Upon seeing the memory cycle complete, 
the MMU continues the protocol, and returns the validation 
result in the F bit of the Slave Status Word. 


The size of a Memory Management operand is always a 32- 
bit Double Word. For further details of the Memory Manage- 


: ° ; ment Instruction set, see the Series 32000 Instruction S 
Floating Point Unit and are therefore available without CPU Reference Manual and the NS3208> MMU Does Shon et 


Any operand indicated as being of type “f” will not cause a 
assistance. 
| 


transfer if the Register addressing mode is specified. This is 
because the Floating Point Registers are physically on the 


TABLE 3-5. 
Memory Management Instruction Protocols 
Operand 1 Operand 2 Operand 1 Operand 2 Returned Value PSR Bits 


Mnemonic Class Class Issued Issued Type and Dest. Affected 

RDVAL* addr N/A D N/A N/A F 

WRVAL* addr N/A D N/A N/A F 

LMR* read.D N/A D N/A N/A none 

SMR* write.D N/A N/A N/A D to Op. 1 none 
Note: 


in the RDVAL and WRVAL instructions, the CPU issues the address as a Double Word, and performs a single-byte Read cycle from that memory address. For 
details, see the Series 32000 Instruction Set Reference Manual and the NS32082 Memory Management Unit Data Sheet. 


D = Double Word 
* = Privileged Instruction: will trap if CPU is in User Mode. 
N/A = Not Applicable to this instruction. 


9.218 


3.0 Functional Description (continued) | 


3.9.4 Custom Slave Instructions 


Provided in the NS32016 is the capability of communicating 
with a user-defined, “Custom” Slave Processor. The in- 
struction set provided for a Custom Slave Processor defines 
the instruction formats, the operand classes and the com- 
munication protocol. Left to the user are the interpretations 
of the Op Code fields, the programming model of the Cus- 
tom Slave and the actual types of data transferred. The pro- 
tocol specifies only the size of an operand, not its data type. 


Table 3-6 lists the relevant information for the Custom Slave 
instruction set. The designation “‘c” is used to represent an 


operand which can be a 32-bit (“D”) or 64-bit (““Q”) quantity 
in any format; the size is determined by the suffix on the 
mnemonic. Similarly, an “i” indicates an integer size (Byte, 
Word, Double Word) selected by the corresponding mne- 
monic suffix. 

Any operand indicated as being of type ‘c’ will not cause a 
transfer if the register addressing mode is specified. It is 
assumed in this case that the slave processor is already 
holding the operand internally. 


For the instruction encodings, see Appendix A. 


TABLE 3-6. 
Custom Slave Instruction Protocols 
Operand 1 Operand 2 Operand 1 Operand 2 Returned Value PSR Bits 

Mnemonic Class Class Issued Issued Type and Dest. Affected 
CCALOc read.c rmw.c c c c to Op. 2 none 
CCALic read.c rmw.c c c c to Op. 2 none 
CCAL2c read.c rmw.c c c c to On. 2 none 

| CCAL3c read.c rmw.c c c cto Op. 2 none 

| CMOV0c read.c write.c c N/A c to Op. 2 none 

| CMOVic read.c write.c Cc N/A c to Op. 2 none 
CMOV2c read.c write.c Cc N/A c to Op. 2 none 
CMOV3c read.c write.c c N/A c to Op. 2 none 
CCMPc read.c read.c c c N/A N,Z,L 
CCMPitc read.c read.c c c N/A N,Z,L 
CCVOci read.c write.i c N/A ito Op. 2 none 
CCV Ici read.c write.i Cc N/A ito Op. 2 none 
CCV2ci read.c write.i Cc N/A ito Op. 2 none 
CCV3ic readi write.c i N/A c to Op. 2 none 
CCV4DQ read.D write.Q D N/A Q to Op. 2 none 
CCV5QD read.Q write.D Q N/A D to Op. 2 none 
LCSR read.D N/A D N/A N/A none 
SCSR N/A write.D N/A N/A D to Op. 2 none 
CATSTO* addr N/A D N/A N/A F 
CATSTi* addr N/A D N/A N/A F 


N/A 
N/A 


LCR* 
SCR* 


read.D 
write.D 


NOTE: 

D = Double Word 

i = integer size (B,W,D) specified in mnemonic. 

f = Floating Point type (F,L) specified in mnemonic. 
N/A = Not Applicable to this instruction. 
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4.0 Device Specifications 


4.1 PIN DESCRIPTIONS 

The following is a brief description of all NS32016 pins. The 
descriptions reference portions of the Functional Descrip- 
tion, Section 3. 

4.1.1 Supplies 

Power (Vcc): + 5V positive supply. Section 3.1 

Logic Ground (GNDL): Ground reference for on-chip logic. 
Section 3-1. 

Buffer Ground (GNDB): Ground reference for on-chip driv- 
ers connected to output pins. Section 3.1. 


Back-Bias Generator (BBG): Output of on-chip substrate 
voltage generator. Section 3.1. 


4.1.2 Input Signals 


Clocks (PHI1, PHI2): Two-phase clocking signals. Section 
3.2. 

Ready (RDY): Active high. While RDY is inactive, the CPU 
extends the current bus cycle to provide for a slower memo- 
ry or peripheral reference. Upon detecting RDY active, the 
CPU terminates the bus cycle. Section 3.4.1. 


Hold Request (HOLD): Active low. Causes the CPU to re- 
lease the bus for DMA or multiprocessing purposes. Section 
3.6. 


Note: If the HOLD signal is generated asynchronously, it’s set up and hold 
times may be violated. 


In this case it is recommended to synchronize it with CTTL to mini- 
mize the possibility of metastable states. 


The CPU provides only one synchronization stage to minimize the 
HLDA latency. This is to avoid speed degradations in cases of heavy 
HOLD activity (.e. DMA controller cycles interleaved with CPU 
cycles.) 
Interrupt (INT): Active low, Maskable interrupt request. 
Section 3.8. 
Non-Maskable Interrupt (NMI): Active low, Non-Maskable 
interrupt request. Section 3.8. 


Reset/Abort (RST/ABT): Active low. If held active for one 
clock cycle and released, this pin causes an Abort Com- 
mand, Section 3.5.4. lf held longer, it initiates a Reset, Sec- 
tion 3.3. 


4.1.3 Output Signals 


Address Bits 16-23 (A16-A23): These are the most sig- 
nificant 8 bits of the memory address bus. Section 3.4. 


Address Strobe (ADS): Address low. Controls address 
latches; indicates start of a bus cycle. Section 3.4. 


Data Direction In (DDIN): Active low. Status signal indicat- 
ing direction of data transfer during a bus cycle. Section 3.4. 


High Byte Enable (HBE): Active low. Status signal enabling 
transfer on the most significant byte of the Data Bus. Sec- 
tion 3.4; Section 3.4.3. 
Note: The HBE signal is normally floated when the CPU grants the bus in 
response to a DMA request on the HOLD pin. 
However, when an MMU is used and the bus is granted during an 
MMU page table look-up, HBE is not floated since the CPU does not 
have sufficient information to synchronize the release of HBE to the 
MMU’s bus cycles. 
Therefore, in a memory managed system, an external TRI-STATE 
buffer is required. This is shown in Figure B-1 in Appendix B. 
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Status (STO-ST3): Active high. Bus cycle status code, STO 
least significant. Section 3.4.2. Encodings are: 


0000 — Idle: CPU Inactive on Bus. 

0001 — Idle: WAIT Instruction. 

0010 — (Reserved) 

0011 — Idle: Waiting for Slave. 

0100 — Interrupt Acknowledge, Master. 
0101 — Interrupt Acknowledge, Cascaded. 
0110 — End of Interrupt, Master. 

0111 — End of Interrupt, Cascaded. 

1000 — Sequential Instruction Fetch. 

1001 — Non-Sequential Instruction Fetch. 
1010 — Data Transfer. 

1011 — Read Read-Modify-Write Operand. 
1100 — Read for Effective Address. 

1101 — Transfer Slave Operand. 

1110 — Read Slave Status Word. 

1111 — Broadcast Slave ID. 


Hold Acknowledge (HLDA): Active low. Applied by the 
CPU isn response to HOLD input, indicating that the bus has 
been released for DMA or multiprocessing purposes. Sec- 
tion 3.7. 

User/Superior (U/S): User or Supervisor Mode status. 
Section 3.7. High state indicates User Mode, low indicates 
Supervisor Mode. Section 3.7. 


Interlocked Operation (ILO): Active low. Indicates that an 
interlocked instruction is being executed. Section 3.7, 


Program Flow Status (PFS): Active low. Pulse indicates 
beginning of an instruction execution. Section 3.7. 


4.1.4 Input/Output Signals 


Address/Data 0-15 (ADO-AD15): Multiplexed Address/ 
Data information. Bit 0 is the least significant bit of each. 
Section 3.4. 


Address’ Translation/Slave Processor Control 
(AT/SPC): Active low. Used by the CPU as the data strobe 
output for Slave Processor transfers; used by Slave Proces- 
sors to acknowledge completion of a slave instruction. Sec- 
tion 3.4.6. Section 3.9. Sampled on the rising edge of Reset 
as Address Translation Strap. Section 3.5.1. 

In Non-Memory-Managed systems this pin should be pulled- 
up to Vcc through a 10 kQ. resistor. 

Data Strobe/Float (DS/FLT): Active low. Data Strobe out- 
put, Section 3.4, or Float Command input, Section 3.5.3. Pin 
function is selected on AT/SPC pin, Section 3.5.1. 











4.0 Device Specifications (Continued) 
4.2 ABSOLUTE MAXIMUM RATINGS 



















Specifications for Military/Aerospace products are not All Input or Output Voltages With 

contained in this datasheet. Refer to the associated Respect to GND —0.5V to +7V 
reliability electrical test specifications document. ; Power Dissipation | 1.5 Watt 
Temperature Under Bias ° Cto + 70°C Note: Absolute maximum ratings indicate limits beyond 
Storage Temperature —65°C to + 150°C which permanent damage may occur. Continuous operation 


at these limits is not intended; operation should be limited to 
those conditions specified under Electrical Characteristics. 


4.3 ELECTRICAL CHARACTERISTICS Ta = 0 to + 70°C, Vcc = 5V 15%, GND = OV 


| Parameter | Conditions =| Min. | Typ | Max 
| togicaltinputvottage | fT oot 
-Lesenioretvetnge _{_____|_-os_{ | os _ 
| Logical 1 Clock Votage | PHIT,PHI2pinsony | Voo-0.85 | | Voc +05 | 
Togal0 GockVolage | PH. PHlapins ony ——~| os | | os | 


Logical 0 Clock Voltage, 


Transient fringing ioleranc: 












PHI1, PH!2 pins only 
isient (ringing tc! 


Logical 1 Output Voltage lon= —400 pA va 
Logical 0 Output Voitage loL=2 mA Po 
AT/SPC Input Current (low) Vin=0.4V, AT/SPC in input mode | 005 | 


Input Leakage Current 0<Vin<Vcc, All inputs except | 920 
PHI1, PHI2, AT/SPC | 


IO(OFF) | Output Leakage Current 
| 


















Output Pins in 
TRI-STATE condition 


Active Supply Current 





0.4<Vout<Vcc | —20 i | 30 pA 
i 



















lout = 0, Ta= 25°C 





Connection Diagram 


Dual-In-Line Package 











vec 
A23 
iNT 
NMI 
iLO 
STO 
sT1 
$T2 
ST3 
PFS 
DDIN 
ADS 
us Order Number NS32016D or NS32016N 
AT/SPC See NS Package Number D48A or N48A 
RST/ABT 

DS/FLT 

HBE 
HULDA 
HOLD 
BBG 
ROY 
PHI2 
PHIt 
GNOB 
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12 NS32016 37 
cpu 

















TL/EE/5054-2 
Top View 


FIGURE 4-1 
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4.0 Device Specifications (continued) 
4.4 SWITCHING CHARACTERISTICS 


4.4.1 Definitions ed in Figures 4-2 and 4-3, unless specifically stated other- 


All the timing specifications given in this section refer to wise. 
2.0V on the rising or falling edges of the clock phases PHI1 ABBREVIATIONS: 


and PHI2 and 0.8V or 2.0V on ail other signals as illustrat- LE. — leading edge R.E. — rising edge 


T.E. — trailing edge F.E. — falling edge 


tsiGtI 


tsiGan 


TL/EE/5054—42 
FIGURE 4-2. Timing Specification Standard 
(Signal Valid After Clock Edge) TL/EE/5054-43 
FIGURE 4-3. Timing Specification Standard 
(Signal Valid Betore Clock Edge) 
4.4.2 Timing Tables 


4.4.2.1 Output Signals: Internal Propagation Delays, NS32016-6, NS32016-8 and NS32016-10 
Maximum times assume capacitive loading of 100 pF. 


— el eee enn FSS ee 


taLy Address bits 0-15 valid | after R.E., PHI T1 _ 65 55 | | 40 | 


tALh a Address bits 0-15 hold _| after R.E., PHI1 Tmmu or T2 5 


| 4-4 | Data valid (write cycle) after R.E., PHI1 T2 70 
| 4-4 | Datahold (write cycle) _| after R.E., PHI next T1 or Ti 
taHy Address bits 16-23 valid _| after R.E., PHI1 T1 
tan | 4-4. | Address bits 16-23 hold _| after R.E., PHI1 nextT1 or Ti| 0 0 
tALADSs | 45 Address bits 0-15 setup _| before ADS T.E. 25 25 
4. 


tAHADSs 45 [Adiees bis 16-28sctup [bsinemDsTe fas | | as | | as {| | 
taapsn| 49 [Addressbits0-1Shold |atterADSTe. | 20 | | 20 | | is | 
tarapsn| 49 [Addressbits 16-23hoid atterADSTe. | 20 | | 20 | | 5 | 


tat 4-5 | Address bits 0-15 floating | after R.E., PHI1 T2 
(no MMU) 

taLMt 4-9 | Address bits 0-15 floating | after R.E., PHI1 Tmmu 
(with MMU) 

taHMf Address bits 16-23 floating | after R.E., PHI1 Tmmu 
(with MMU) 


typen | 4-4 _| ABE signal hold afterR.E.,PHItnextTiorTi| o | =| o | 


tsty 4-4 | Status (STO-ST3) valid after R.E., PHI T4 
(before T1, see note) 


ts | 44 |Status(STO-ST3)hold__[afterRE,PHINTA(afterT)| o | 
tooiN ee 
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4.0 Device Specifications (Continued 
4.4.2.1 Output Signals: Internal Propagation Delays, NS32016-6, NS32016-8 and NS32016-10 (Continued) 


Name Description Reference/Conditions |_Nss2016-6 | Ns32016-8 | NS32016-10 | Units 
ee 


cone [fet fel [we 
= Se r 
Foe | +4 [ADS sorainacive later, pueT! | ve | eo | | a6 | 18 | 4 | ns 
fiosw [+4 [ADS pusewan —|atorvbsineasey [co | [= | [| | 
‘sae [eteoweegey ener Te | Te 


taLt 4-6 |ADO-AD15 floating (caused by [after R.E., PHI1 T1 
HOLD) 
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taHt 4-6 |A16-A23 floating (caused by after R.E., PHI1 T1 100 | 65 | 
HOLD) 
toss | 4-6 |DS floating (caused by HOLD) _ [after R.E., PHI Ti | 100 | 80 - ns 





tans: | 4-6 |ADS floating (caused by HOLD) |after R.E., PHI1 Ti | 100 | i go | 55 | ns 
HBE floating (caused by HOLD) |after R.E., PHI1 Ti | ! 
DDIN floating (caused by HOLD) | after R.E., PHI1 Ti 
| tu_DAa| 4-6 |HLDA signal active (low) lafter R.E., PHIM Ti | | | 75 | ns 
tuLDAia; 4-8 |HLDA signal inactive after R.E., PHI Ti 100 | | 90 | 75 ns 


tpsr | 48 DS signal returns from floating | after R.E., PHI1 Ti 
i (caused by HOLD) ' | 




























































tansr _ 4-8 | ADS signal returns from floating | after R.E., PHI1 Ti ; 100 | ; 80 | 55 | ns 
(caused by HOLD) | | | | 
| tyeer | 4-8 | HBE signal returns from floating | after R.E., PHI Ti | 100 | | 80 | 55 | ns 
| | | (caused by HOLD) | | | | | | 
tpDINr 4-8 |DDIN signal returns from floating | after R.E., PHI1 Ti | 100 80 | 55 | ns 
(caused by HOLD) 





topint 4-9 |DDIN signal floating (caused by | after FLT F.E. tet ie; tele 
FLT) 


tHBEI HBE signal low (caused by FLT) | after FLT F.E. | 100} | a5 | 


tppinr |. 4-10 |DDIN signal returns from floating | after FLT F.E. 75 65 ns 
| (caused by FLT) 


tuper | 4-10 |HBE signal returns from LOW _|after FLT F.E. 90 85 75 | ns 
(caused by FLT) 

ispca_| 4-13 |SPC output active (low) after R.E., PHI T1 50 45 | | 35 | ns 

tspcia , 4-13 SPC output inactive | after R.E., PHI1 T4 | 50 45 | ; 35 | ns 


after R.E., PHI2 T4 
| Data valid (slave processor write) | after R.E., PHI1 T1 
toh 4-13 (Data hold (slave processor write) | after R.E., PHI1 next Ti or Ti} =O | 0 | | Oo | i ns 
tpesw | 4-18 | PFS pulse width at 0.8V (both edges) 70 70 70 / ons 


25 





|SPC output nonforcing 





tspcnt | 



















































| PFS pulse active (low) after R.E., PHI2 

tprsia | | PFS pulse inactive after R.E., PHI2 
tiLos | 4-20a ILO signal setup before R.E., PHI1 T1 of first 30 | 30 30 | ns 

| | interlocked write cycle | | 
Ge—o ee 
tion | 4-20b ILO signal hold after R.E., PHI1 T3 of last 10 10 10 ns 

| ' interlocked read cycle | 
tiLoa | 4-21 ILO signal active (low) after R.E., PHI 70 65 ; 55 | ns 
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4.0 Device Specifications (Continued) 
4.4.2.1 Output Signals: Internal Propagation Delays, NS32016-6, NS32016-8 and NS32016-10 (Continued) 


_Nss2016-6 | Ns32016-8 | NS32016-10 | 

Description Reference/Conditions 

pee — Eos ra we 
45 | | 35 | 


twoa | 421 [TCOsignalinactve fafterE.PHM || 85 | 
[422 [u/Ssignalvaid _—atterre.pHiita || 55 | | 45 || 85 
+22 |uSsqrainod__foternenerints | wo} {ro} fej 


4-19b | Nonsequential fetch to next PFS | after R.E., PHI1 T1 4 
clock cycle 
tprns | 4-19a | PFS clock cycle to next non- before R.E., PHI1 T1 
sequential fetch 
4-29 |Last operand transfer of an instruc- | before R.E., PHI1 T1 of first 
tion to next PFS clock cycle bus cycle of transfer 


Note: Every memory cycle starts with T4, during which Cycle Status is applied. If the CPU was idling, the sequence will be: “. . . Ti, 74,71. . .”. If the CPU was 
not idling, the sequence will be: “. . . 14,71. . .”. 


4.4.2.2 Input Signal Requirements: NS32016-6, NS32016-8 and NS32016-10 


|_Nsa2016-6 | Ns32016-8 | NS32016-10 | 
Name Description Reference/Conditions Units 
a rae a 


run | 425 [PonersabiowASTRE letevoomaneay || [so] [| | 
cx [45 [pennmototons fsweresrazrs [| [m [ Ts 


tHLDa 4-6 HOLD active (low) setup time | before F.E., PHI2 TX1 
(see note) 


tHLDia 4-8 HOLD inactive setup time before F.E., PHI2 Ti fos | | 


tutph | 4-6 | HOLD hold time after R.E., PHI1 TX2 


Sot 
tFLTa 4-9 FLT active (low) setup time before F.E., PHi2 Tmmu ee ee 
teLTia | 4-11. | FLT inactive setup time before F.E., PHI2 T2 | 25 | | a5 | | as 

trpys | 4-11, 4-12| RDY setup time beforeF.E.,PHI2T20r73] 25 | | 2 | [6] | 
tapyh | 4-11, 4-12 | RDY hold time after F.E., PHI1 T3 fo] [of j|o}] | 
taBTs 4-23 ABT setup time (FLT inactive) lbeforeF.E,PHI2Tmmu | 30 | | 2] | 20) | 
taBTs 4-24 T setup time (FLT active) lbeforeF.E,PHI2T; | 30 | | 2] | 2) | 

[0 


taBTh 4-23 T hold time after R.E., PHI1 
trsTs | 4-25, 4-26 | RST setup time before F.E., PHI1 
tastw 4-26 RST pulse width at 0.8V (both edges) 


tNTs 4-27 _| INT setup time before F.E., PHI1 


“ NMI . 
on [414 [ows soup oaveroodg [tower e.ruzry | 0 [= | T~ | 
im [#14 [Catal avereadeyte) [aferne. Punta 10 | [| |] 
= Pa |aneeatornae (owneimen eo [ To {Taf 


tspcw 4-15 |SPC G pulse width from slave at 0.8V ‘both edges) 
__ | processor (async.input) 

















ABT 
ABT 
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4.0 Device Specifications (Continued) 
4.4.2.2 Input Signal Requirements: NS32016-6, NS32016-8 and NS32016-10 (Continued) 


Powe] Desrpton Reference/Conditions eerie NS32016-8 serie 


4-16 | AT/SPC hold for address | after F.E., PHI1 of cycle during 
translation strap which RST pulse is removed 
4-16 | AT/SPC setup for address | before R.E., PHI1 of cycle during 
translation strap which RST pulse is removed 
Note: This setup time is necessary to ensure prompt acknowledgement via HLDA and the ensuing floating of CPU off the buses. Note that the time from the receipt 


of the HOLD signal until the CPU floats is a function of the time HOLD signal goes low, the state of the RDY input (in MMU systems), and the length of the current 
MMU cycle. 





















Units 











4.2.3 Clocking Requirements: NS32016-6, NS32016-8 and NS32016-10 











Reference/ 


~ co 
ee i 


4-17 |PHI1, PHI2 rise time 0.8V to Voc — 0.9V 
on R.E., PHI1, PHi2 


4-17 |PHI1, PHI2 fali time Voc — 0.9V to 0.8V 
on F.E., PHI1, PHI2 


4-17 |Clock period R.E., PHi1, PHI2 to next |s000 
R.E., PHI, PHI2 
4-17 |PHIH, PHI2 At 2.0V on PHH, PHI2 
47 width (both edges) 0.5 tcp 14| 0.8 tcp | , 


4-17 !PHI1, PHI2 high time !atVoc — 0.9V on 


PHI, PHI2 (both edges) 


0.8V on F.E., PHI1, PHI2 to 
0.8V on R.E., PHI2, PHI1 


|Non-overlap asymmetry|At 0.8V on PHI1, PHI2 
(thovi(1) — thovi(2y) 


| PHI1, PHI2 asymmetry |At 2.0V on PHI1, PHI2 an 4 
(tcLw1) — tcLw2)) 


Name 








ns 










toLwi1,2) 





totn1,2) 





ns 













thov_(1,2) Non-overlap time 


| ns 






thovLas 








tcLwas 





ns 





9.8395 


01-91 0Z€SN/8-910ZESN/9-9102ESN 





NS32016-6/NS32016-8/NS320 16-10 


4.0 Device Specifications (Continued) 


4.4.3 Timing Diagrams 


| 11 








T2 | 








(HIGH) 
i 





FIGURE 4-4. Write Cycle 





VALID 








(HIGH) 





FIGURE 4-5. Read Cycle 
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4.0 Device Specifications (Continued) 


HEE 
BDIN 


ADO-15 [ 


A16-23 [ 

















arene en 





(FLOATING) 


FIGURE 4-6. Floating by HOLD Timing (CPU Not Idle Initially) 


TL/EE/5054-46 


Note that whenever the CPU is not idling (not in Ti), the HOLD request (HOLD low) must be active typ, before the falling edge of PHI2 of the clock cycle that 
appears two clock cycles before T4 (TX1) and stay low until ty_pp after the rising edge of PHI1 of the clock cycle that precedes T4 (TX2) for the request to be 


acknowledged. 


Tit 
PHI1 









Ti2 





























TL/EE/5054-47 
FIGURE 4-7. Floating by HOLD Timing (CPU Initially Idle) 
Note that during Ti1 the CPU is already idling. 
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Ti | T4 
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HBE | ~ |" (FLOATING) 
DDIN 


A16-23 | —/—_§ —-$_ —~—— 








ADO-15 














(HIGH) 





(FLOATING) 





FIGURE 4-8. Release from HOLD 
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4.0 Device Specifications (continued 


CPU STATES | "1 | TMMU tT Tt 


MMU STATES 1 OI TMMU 14 11 


pat | | | | | | | | | 


FUT [ taumt 
im tFLTa 
ADO-15 [ ADDRESS (CPU))- — —-+——— ————~ + 
(CPU) ; (FLOATING, DRIVEN BY MMU) 
A16-23 
(CPU) (FLOATING, DRIVEN BY MMU) 
ADS 
(CPU) 
oH aie 
(CPU) (FLOATING, DRIVEN BY MMU) 
topINt 


es 
wel 














tuBet (FORCING LOW) 1 
TL/EE/5054-49 
FIGURE 4-9. FLT Initiated Cycle Timing 


MMU STATES 


a 


CPU STATES Tt | T2 | 13 | T4 


— 


(FLOATING, DRIVEN BY MMU) 








(FORCING LOW) 
TL/EE/5054-50 


Note that when FLT is deasserted the CPU restarts driving DDIN before the MMU releases it. This, however, does not cause any conflict, since both CPU and MMU 
force DDIN to the same logic level. 


FIGURE 4-10. Release from FLT Timing 





TL/EE/5054-51 
FIGURE 4-11. Ready Sampling (CPU Initially READY) 
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4.0 Device Specifications (Continued) 





| TL/EE/5054-52 
| FIGURE 4-12. Ready Sampling (CPU Initially NOT READY) 
| 





STATUS VALID 








STO-3 [ 
crea 7 STATUS VALID erase 


40S [ (HIGH) 











re a 


TL/EE/5054-53 








NEXT STATUS 


TL/EE/5054—-54 


FIGURE 4-13. Slave Processor Write Timing FIGURE 4-14. Slave Processor Read Timing 


SPC 
(FROM SLAVE) 





FIGURE 4-15. SPC Timing 


After transferring last operand to a Slave Processor, CPU turns 
OFF driver and holds SPC high with internal 5 k®. pullup. 


PHIt [ 
PHI2 [ 
(FROM ri [ 





tats 


FIGURE 4-16. Reset Configuration Timing 
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4.0 Device Specifications (Continued) 


touw(1) 
+— ter n(1) —| 


















thovi(1) thovi(2) 
TL/EE/5054-57 
FIGURE 4-17. Clock Waveforms 


__ TL/EE/5054-58 
FIGURE 4-18. Relationship of PFS to Clock Cycles 


lon | 
“(FLOSS 


a 


tPENS 


ST0-3 X CODE 1001 


FIGURE 4-19a. Guaranteed Delay, PFS to Non-Sequential Fetch 





TL/EE/5054-59 





3 
mv 
a 





__ TL/EE/5054-60 
FIGURE 4-19b. Guaranteed Delay, Non-Sequential Fetch to PFS 
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4.0 Device Specifications (Continued) | 


| T3 OR Ti | T4 OR Ti | v1 | T2 | T3 | T4 | 
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—_ TL/EE/5054-61 
FIGURE 4-20a. Relationship of ILO to First Operand Cycle 
of an Interlocked Instruction 


T3 OR Ti T4ORTi 




















__ TL/EE/5054-62 
FIGURE 4-20b. Relationship of ILO to Last Operand Cycle 
of an interlocked Instruction 


“(SLIPS 
tof 





tiLoa ttLoia 


FIGURE 4-21. Relationship of ILO to Any Clock Cycle 
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| T30ATI | T4 ORTi 








TL/EE/5054-64 


FIGURE 4-22. U/S Relationship to Any Bus Cycle— 
Guaranteed Valid interval 
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4.0 Device Specifications (Continued 
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PHIi 


DS/FLT 





RST/ABT 





| 
“| 
| 
| 
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FIGURE 4-24. Abort Timing, FLT Applied 


PHI1 


rari | 





4 


FIGURE 4-25. Power-On Reset 


TL/EE/5054-67 





TL/EE/5054~68 
FIGURE 4-26. Non-Power-On Reset 
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4.0 Device Specifications (Continued) 


bof tNTs 


INT [ | _ TL/EE/5054~70 
FIGURE 4-28. NMI Interrupt Signal Timing 


_ TL/EE/5084-69 
FIGURE 4-27. INT Interrupt Signal Detection 


FIRST BUS CYCLE NEXT 


| 
| | 
ALLL 


af 


nh ———— tLXPF | 


nn 


FIGURE 4-29. Relationship Between Last Data Transfer of 
an Instruction and PFS Pulse of Next Instruction 
NOTE: 


In a transter of a Read-Modify-Write type operand, ihis is the Read transfer, 
displaying RMW Status (Code 1011). 
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Options: in String Instructions 


pu | 6 | 7 | 


Appendix A: Instruction Formats 


NOTATIONS: 
T = Translated 


i= Integer Type Field 


B = 00 (Byte) 

W = 01 (Word) 

D = 11 (Double Word) 
Floating Point Type Field 


B = Backward 

U/W = 00: None 
01: While Match 
11: Until Match 


NS32016-6/NS32016-8/NS32016-10 


F = 1 (Std. Floating: 32 bits) 
L= 0 (Long Floating: 64 bits) 
Custom Type Field 

D = 1 (Double Word) 


Configuration bits, in SETCFG: 


Q = 0 (Quad Word) mreg: MMU Register number, in LMR, SMR. 
Operation Code 0000 = BPRO 


Valid encodings shown with each format. 0001 = BPR1 
gen, gen 1, gen 2= General Addressing Mode Field 0010 = (Reserved) 
See Sec. 2.2 for encodings. 0011 = (Reserved) 
reg = General Purpose Register Number 0100 = PFO 
cond = Condition Code Field 0101 = PF1 
0000 = EQual: Z = 1 0110 (Reserved) 
0001 = Not Equal: Z = 0 0111 = (Reserved) 
0010 = Carry Set: C = 1 1000 = SC 
0011 = Carry Clear: C = 0 1001 = (Reserved) 
0100 = Higher: L = 1 1010 = MSR 
0101 = Lower or Same: L = 0 1011 = BCNT 
0110 = Greater Than: N = 1 1100 = PTBO 
0111 = Less or Equal: N = 0 4101 = PTB1 
1000 = Flag Set: F = 1 1110 = (Reserved) 
1001 = Flag Clear: F = 0 1111 = EIA 
1010 = LOwer: L = OandZ = 0 
1011 = Higher or Same: L = 1 orZ = 1 
1100 = Less Than: N = O and Z = 0 
1101 = Greater or Equal: N = 10rZ = 1 Format 0 
1110 = (Unconditionally True) 
1111 = (Unconditionally False) 
Short Immediate Value. May contain: 
quick: Signed 4-bit value, in MOVQ, ADDQ, 
CMPQ, ACB. 

. Format 1 
cond: Condition Code (above), in Scond. BSR ~0000 ENTER 
areg: CPU Dedicated Register, in LPR, SPR. RET —~0001 EXIT 

0000 = US CXP —0010 NOP 
0001 — 0111 = (Reserved) RXP —0011 WAIT 
1000 = FP RETT —0100 

1001 = SP RETI —0101 

1010 = SB SAVE —0110 

1011 = (Reserved) RESTORE —0111 

1100 = (Reserved) 
1101 = PSR 

1110 = INTBASE 
1111 = MOD Format 2 


(BR) 





— 000 ACB 
—001 MOVQ 
—010 LPR 
—011 
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Appendix A: Instruction Formats (continued) 





Format 3 
CXPD —0000 ADJSP —1010 
BICPSR —0010 JSR —1100 
JUMP —0100 CASE —1110 
BISPSR —0110 


Trap (UND) on XXX1, 1000 





Format 4 
ADD —0000 SUB — 1000 
CMP —0001 ADDR —1001 
BIC —0010 AND — 1010 
ADDC —0100 SUBC —1100 
MOV —0101 TBIT —1101 
OR —0110 XOR —1110 





16/15 
Format 5 
MOVS —0000 SETCFG —0010 
CMPS —0001 SKPS —0011 


Trap (UND) on 1XXX, 01XX 





16115 
Format 6 
ROT —0000 NEG — 1000 
ASH ~0001 NOT —1001 
CBIT —0010 Trap (UND) —1010 
CBITI —0011 SUBP —1011 
Trap (UND) —0100 ABS —1100 
LSH —0101 COM —1101 
SBIT -0110 IBIT —1110 
SBITI —0111 ADDP —1111 


92335 





8/7 0 
oe filrroorsre 


Format 7 
MOVM — 0000 MUL — 1000 
CMPM ~0001 MEI — 1001 
INSS —0010 Trap (UND) —1010 
EXTS —0011 DEI —1014 
MOVXBW —0100 QUO —1100 
MOVZBW —0101 REM —1101 
MOVZiD —0110 MOD —1110 
MOVXiID —0111 DIV —1111 
23 16115 8|7 i) 
gen 1 gen 2 reg i 101110 
op 
TL/EE/5054-72 
Format 8 

EXT —000 INDEX — 100 
CVTP —001 FFS —101 
INS —010 

CHECK —011 

MOVSU —110, reg=001 

MOVUS —110, reg=011 





16 l45 
Format 9 
MOVif —000 ROUND —100 
LFSR —001 TRUNC —101 
MOVLF —010 SFSR —110 
MOVFL —011 FLOOR —111 
7 0 
TL/EE/5054-37 
Format 10 


Trap (UND) Always 
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Appendix A: Instruction Formats (Continued) 


Operation Word 


Format 11 


Format 15 
ADDf —0000 DIVE —1000 (Custom Slave) 


MOVf —0001 Trap (SLAVE) —1001 
CMPf —0010 Trap (UND) —1010 Operation Word Format 
Trap (SLAVE) —0011 Trap (UND) —1011 
SUBf —0100 MULf —1100 
NEGf —0101 ABSf —1101 
Trap (UND) —0110 Trap (UND) —1110 
Trap (UND) —O111 Trap (UND) —1111 


Format 15.0 
CATSTO ~—0000 LCR 
CATST1 —0001 SCR 


TL/EE/5054-75 Trap (UND) on all others 


Format 12 23 16/15 


8 
Trap (UND) Always [gent | gonz | on fol i | 


Format 15.1 


TL/EE/5054-~76 — 000 CCV2 

—001 CCV1 

Format 13 —010 SCSR 

Trap (UND) Always —011 CCVvo 


Format 14 Format 15.5 


RDVAL —0000 LMR CCALO —0000 CCAL3 

WRVAL —0001 SMR CMOVO —0001 CMOV3 

Trap (UND) on 01XX, 1XXX CCMPO —0010 Trap (UND) 
CCMP1 —0011 Trap (UND) 
CCAL1 —0100 CCAL2 
CMOV2 —0101 CMOV1 
Trap (UND) —0110 Trap (UND) 
Trap (UND) —0111 Trap (UND) 


If nnn = 010, 011, 100, 110, 111 
then Trap (UND) Always 
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Appendix A: Instruction Formats (continued) | 





7 0 
_ TL/EE/5054-77 
Format 16 
Trap (UND) Always 
: TL/EE/5054-78 
Format 17 
Trap (UND) Always 
7 o 
TL/EE/5054-79 
Format 18 


Trap (UND) Always 


_ 
°o 


TL/EE/5054-—80 
Format 19 
Trap (UND) Always 
implied Immediate Encodings: 
7 0 


7 6 6S 4 8 wf nt 
Register Mask, appended to SAVE, ENTER 


~l 
=] 


ro r1 r2 r3 r4 r5 ré ‘7 


Register Mask, appended to RESTORE, EXIT 


~“ 


offset length—1 


Otffset/Length Modifier appended to INSS, EXTS 
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8EE-6 


RESET 
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FIGURE B-1. System Connection Diagram 
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